Hello and welcome! I’m Trang Le. I’m a data scientist at Bristol Myers Squibb. I enjoy developing machine learning methods for analyses of biomedical data, including neuroimage (functional/structural MRI), transcriptomics and genotypes. Most of the datasets I work with are high dimensional (i.e., have many predictors/features), so I spend most of my time building feature selection algorithms for these data. I trade my bias toward the nearest-neighbor concept for lower variance of my methods and better generalizability. When I’m not knee deep in data, I run, dance and seasonally ski.

Explorations

10 things I love about Julia

December 28 2021

Those who know me know that my love for the Julia language has grown quite a bit in the past couple years. Still, I had trouble finding a project that would allow me to work more in this nifty language until I saw Jasmine Hughes solve all of last year Advent of Code (AOC) puzzles in Julia. ⊕ I know of Jasmine through the AOC RLadies leaderboard! See Jasmine’s valuable advice on “how to get …

Read More…

As I work closely with graduate students (whom I dearly call my apprentices), I share with them some tools I picked up along the way that help boost my productivity and just make my life easier in general. Often, they come back with “WOW!!! XYZ has been really helpful! I wish I knew about it 5 years ago.” So do I. Well, some of these tools might have been at their early stage or not even existed …

Read More…

Những tuần gần đây, mình đọc một số bài viết về nạn phân biệt chủng tộc ở Mĩ, chủ yếu là từ trải nghiệm cá nhân của từng người. Cho đến rất gần đây, mình vẫn nhìn việc phân biệt chủng tộc này từ một góc rất hạn hẹp: góc độ cá nhân. Bạn Trang Quạ viết: “sự kì thị này ăn sâu vào tiềm thức của các cá thể trong cộng đồng”. Mình đồng ý. Nhưng vấn đề chủng tộc ở Mĩ lớn hơn từng cá thể rất …

Read More…

It took me 30 minutes to figure this out. I hope it takes you less. Earlier today, I submitted a manuscript to GECCO Hot-Off-the-Press track. The submission process was pretty straightforward, until I hit Submit and encountered this error: […] All fonts must be embedded in the PDF. […] Googling this error led me to fiddle around with Acrobat Reader, try different TeX engines and …

Read More…

 

Recent Works

  • treeheatr and pmlbr: visualizing decision trees on benchmark datasets. R-Ladies Johannesburg, Sep 14, 2021      
  • Take a bad chart and make it better. IMS, Aug 30, 2021      
  • Take a bad chart and make it better. Cleveland R User Group, Aug 25, 2021      
  • On visualization: Take a sad chart and make it better, R Ladies Philly, Dec 8, 2020      
  • tdapseudotime: Implements the temporal phenotyping via topological data analysis. (2020)    
  • Visualizing decision trees on benchmark datasets. R Ladies Miami, Nov 19, 2020      
  • regens: REGENS (REcombinatory Genome ENumeration of Subpopulations) (2020)    
  • pmlbr: an R interface to PMLB (2020)