Hello and welcome! I’m Trang Le. I’m a postdoctoral fellow with Jason Moore at the Computational Genetics Lab, University of Pennsylvania. I enjoy developing machine learning methods for analyses of biomedical data, including neuroimage (functional/structural MRI), transcriptomics and genotypes. Most of the datasets I work with are high dimensional (i.e., have many predictors/features), so I spend most of my time building feature selection algorithms for these data. I trade my bias toward the nearest-neighbor concept for lower variance of my methods and better generalizability. When I’m not knee deep in data, I run, dance and seasonally ski.

Explorations

As I work closely with graduate students (whom I dearly call my apprentices), I share with them some tools I picked up along the way that help boost my productivity and just make my life easier in general. Often, they come back with “WOW!!! XYZ has been really helpful! I wish I knew about it 5 years ago.” So do I. Well, some of these tools might have been at their early stage or not even existed …

Read More…

Những tuần gần đây, mình đọc một số bài viết về nạn phân biệt chủng tộc ở Mĩ, chủ yếu là từ trải nghiệm cá nhân của từng người. Cho đến rất gần đây, mình vẫn nhìn việc phân biệt chủng tộc này từ một góc rất hạn hẹp: góc độ cá nhân. Bạn Trang Quạ viết: “sự kì thị này ăn sâu vào tiềm thức của các cá thể trong cộng đồng”. Mình đồng ý. Nhưng vấn đề chủng tộc ở Mĩ lớn hơn từng cá thể rất …

Read More…

It took me 30 minutes to figure this out. I hope it takes you less. Earlier today, I submitted a manuscript to GECCO Hot-Off-the-Press track. The submission process was pretty straightforward, until I hit Submit and encountered this error: […] All fonts must be embedded in the PDF. […] Googling this error led me to fiddle around with Acrobat Reader, try different TeX engines and …

Read More…

A few days after nonessential business closing due to the COVID-19 pandemic, the streets and trails of Philadelphia are filled with runners. While it’s nice that a lot of people have reverted to this basic form of exercise, Welcome to the club! it pains us regular runners physically when we see you run in jeans and cotton t-shirts. If running for you is a outdoor family activity and the goal is …

Read More…

 

Recent Works

  • treeheatr and pmlbr: visualizing decision trees on benchmark datasets. R-Ladies Johannesburg, Sep 14, 2021      
  • Take a bad chart and make it better. IMS, Aug 30, 2021      
  • Take a bad chart and make it better. Cleveland R User Group, Aug 25, 2021      
  • On visualization: Take a sad chart and make it better, R Ladies Philly, Dec 8, 2020      
  • Visualizing decision trees on benchmark datasets. R Ladies Miami, Nov 19, 2020      
  • tdapseudotime: Implements the temporal phenotyping via topological data analysis. (2020)    
  • regens: REGENS (REcombinatory Genome ENumeration of Subpopulations) (2020)    
  • pmlbr: an R interface to PMLB (2020)