Email · LinkedIn · Google Scholar · GitHub
Hi there! I'm a first-year Computer Science PhD student at Princeton University supported by the NSF CSGrad4US fellowship. I'm broadly interested in developing statistical and machine learning methods to help advance precision medicine, especially for cancer.
Formerly, I was a computational associate at the Broad Institute of MIT and Harvard and Massachusetts General Hospital where I was really lucky to be co-mentored by Profs. Gad Getz and Esther Rheinbay. Before that, I studied Computational Biology and Engineering at Brown University, where I am very grateful to have been advised by Prof. Ritambhara Singh.
SVelfie (Structural Variants enriched with likely functional/impactful events, pronounced "SVEL-fee") is a statistical method to infer structural variant (SV) driver genes in cancer genome cohorts by analyzing the enrichment of likely functional effects based off breakpoint arrangements. This is an alternative approach to traditional approaches focusing on SV breakpoint frequencies, and it can potentially be used to complement such traditional methods. SVelfie recapitulates known drivers and yields novel candidates, and was applied in a Nature Genetics publication (see below).
This study presents a comprehensive genomic analysis of multiple myeloma (MM) and its precursor stages, identifying candidate drivers from mutations, copy number alterations, and structural variants from 1,030 patients. Using these findings, an “MM-like” score was developed that predicts disease progression, risk classification, and subclonal dynamics, offering insights to improve early intervention and monitoring strategies. The method I developed, SVelfie (see one above) was used to infer structural variant drivers integrated into this MM-like score, and we also found specific mutational aetiologies associated with such SV candidate drivers (see above graphic).
This paper develops machine learning models to classify new cancer samples into predefined TCGA molecular subtypes using multi-omic data from 8,791 tumor samples across 26 cancer cohorts. By validating the models with external datasets and providing containerized versions of the top-performing classifiers, this work attemps to enable more accessible and accurate application of molecular subtyping in clinical and research settings.
Introduces GC-MERGE, a graph convolutional model that integrates long-range genomic interactions and local regulatory factors (specifically histone modifications) to predict gene expression while capturing the 3D structure of the genome. By applying the model to multiple cell lines, the study demonstrates state-of-the-art predictive performance and interprets the biological mechanisms behind its predictions, offering potential insights into the local and long-range regulatory factors influencing gene expression.
Introduces XL-MERGE, a modified version of GC-MERGE with three different mechanisms for improvement, outperforming GC-MERGE for all three tested cell lines in terms of AUROC.
I love mentoring students and building both their enthusiasm and skills in research and academics! I have mentored three undergraduate students in Getz Lab, and used to be a classroom leader for the after-school volunteering program Buddies4Math.
Additionally, I was part of the steering committee for RATalks, a group at the Broad Institute that hosts talks to the greater community. For this, I conceptualized and led planning of our very first alumni RATalk which sought to disseminate career and life perspective to the early-researcher community by a former associate director of the Broad Institute, increasing the availability of mentorship to junior researchers!
I love to read and average around 30 books per year. What I love most about reading is how it enables better understanding of those around us and their perspectives. I think this is especially important in research, where this can help me to be a better collaborator and also to create a more welcoming environment to others from diverse backgrounds. Some books that have inspired me:
Due in part to the perspectives I gained from my reading, I also wrote an essay about how to increase diversity in computer science-related disciplines, which incidentally helped earn me $159,000 of funding for my PhD via the NSF's CSGrad4US fellowship.