Xavi Loinaz

Email · LinkedIn · Google Scholar · GitHub


Hi there! I'm an associate computational biologist at the Broad Institute of MIT and Harvard and Massachusetts General Hospital where I'm really grateful to be co-mentored by Profs. Gad Getz and Esther Rheinbay. My work currently focuses on computational cancer genomics, with a special focus on structural variation. Previously, I studied Computational Biology and Engineering at Brown, where I'm very lucky to have been advised by Prof. Ritambhara Singh.

I'm also currently applying for PhD programs where I have the support of the NSF CSGrad4US fellowship ($159,000 total)!



Selected Research


SSVGAR: A computational method to discover candidate structural variant driver genes based on cohorts of tumor genomes
Xavi Loinaz, Johnathan Dagan, Chip Stewart, Jean-Baptiste Alberge, Julian Hess, Esther Rheinbay, Gad Getz
Poster for Broad Institute Scientific Retreat, December 2023; in preparation for initial submission to Nature Methods with preprint imminent for bioRxiv SSVGAR (Significant Structural Variant Genic Association Revelation, pronounced "sugar") is a computational method to infer structural variant (SV) driver genes in cancer genome cohorts by analyzing the enrichment of putative functional effects based off breakpoint arrangements for specific genes. This is an alternative approach to traditional approaches focusing on SV breakpoint frequencies, and it can potentially be used to complement such traditional methods in the future. SSVGAR has demonstrated efficacy in recapitulating known drivers and discovering novel candidates, and was applied in a Nature Genetics paper that was accepted in principle (see two publications below).


Genomic landscape of multiple myeloma and of its precursor conditions
Jean-Baptiste Alberge*, Ankit K. Dutta*, Andrea Poletti*, Tim H. H. Coorens, Elizabeth D. Lightbody, Rosa Toenges, Xavi Loinaz, Sofia Wallin, Andrew Dunford, Oliver Priebe, Cody J. Boehner, Erica Horowitz, Nang K. Su, Hadley Barr, Laura Hevenor, Katherine Towle, Rashmika Beesam, Jenna B. Beckwith, Jacqueline Perry, David M. Cordas dos Santos, Luca Bertamini, Patricia T. Greipp, Kirsten Kübler, Peter F. Arndt, Carolina Terragna, Elena Zamagni, Eileen M. Boyle, Kwee Yong, Meletios Athanasios Dimopoulos, Efstathios Kastritis, Julian Hess, Romanos Sklavenitis-Pistofidis, Chip Stewart, Gad Getz**, Irene M. Ghobrial**
Oral presentation at ASH, December 2024; accepted in principle to Nature Genetics
[Poster abstract at ASH 2024]
[Poster abstract at ASH 2023] This study presents a comprehensive genomic analysis of multiple myeloma (MM) and its precursor stages, identifying candidate drivers from mutations, copy number alterations, and structural variants from 1,030 patients. Using these findings, an “MM-like” score was developed that predicts disease progression, risk classification, and subclonal dynamics, offering insights to improve early intervention and monitoring strategies. The method I developed, SSVGAR (see one above) was used to infer structural variant drivers integrated into this MM-like score, and we also found specific mutational aetiologies associated with such SV candidate drivers (see above graphic).


Classification of non-TCGA Cancer Samples to TCGA Molecular Subtypes Using Compact Feature Sets
Kyle Ellrott*, Christopher K. Wong*, Christina Yau*, Mauro A. A. Castro*, Jordan A. Lee*, Brian J. Karlberg*, Jasleen K. Grewal*, Vincenzo Lagani*, Bahar Tercan*, Verena Friedl, Toshinori Hinoue, Vladislav Uzunangelov, Lindsay Westlake, Xavi Loinaz, Ina Felau, Peggy I. Wang, Anab Kemal, Samantha J. Caesar-Johnson, Ilya Shmulevich, Alexander J. Lazar, Ioannis Tsamardinos, Katherine A. Hoadley, The Cancer Genome Atlas Analysis Network, A. Gordon Robertson, Theo A. Knijnenburg, Christopher C. Benz, Joshua M. Stuart, Jean C. Zenklusen, Andrew D. Cherniack**, Peter W. Laird**
Published in Cancer Cell, January 2025
[Oral abstract at AACR 2024] This paper develops machine learning models to classify new cancer samples into predefined TCGA molecular subtypes using multi-omic data from 8,791 tumor samples across 26 cancer cohorts. By validating the models with external datasets and providing containerized versions of the top-performing classifiers, this work attemps to enable more accessible and accurate application of molecular subtyping in clinical and research settings.


Integrating Long-Range Regulatory Interactions to Predict Gene ExPression Using Graph Convolutional Networks
Jeremy Bigness, Xavi Loinaz, Shalin Patel, Erica Larschan, Ritambhara Singh
Published in the Journal of Computational Biology, May 2022 Introduces GC-MERGE, a graph convolutional model that integrates long-range genomic interactions and local regulatory factors (specifically histone modifications) to predict gene expression while capturing the 3D structure of the genome. By applying the model to multiple cell lines, the study demonstrates state-of-the-art predictive performance and interprets the biological mechanisms behind its predictions, offering potential insights into the local and long-range regulatory factors influencing gene expression.


Exploring Graph-Based Neural Networks for Modeling Long-Range Epigenetic Gene Regulation
Xavi Loinaz (advised by Ritambhara Singh)
Brown University Honors Senior Thesis, May 2021
[Presentation for Brown CS Research Symposium] Introduces XL-MERGE, a modified version of GC-MERGE with three different mechanisms for improvement, outperforming GC-MERGE for all three tested cell lines in terms of AUROC.


Polymeric Nanoparticle Surface Coatings for Biomedical Applications
Xavi Loinaz, Xiaoting Guo, Jake Villanova, Hyewon Kim, Vicki Colvin
Poster for Brown Undergraduate Summer Research Symposium, August 2019 I worked on two different research projects: one observing protein crystallization in the presence of gold nanoparticles, and one on creating gadolinium-oxide MRI contrast agents for characterizing glioma. For the latter, I determined a way to synthesize and demonstrate desired PAMPS-LA polymer surface coating of desired size, an issue that had been unresolved in the lab for approximately a year.


A Brief Analysis of Predictive Pitching Metrics
Xavi Loinaz
Published on FanGraphs Community Research Blog, January 2018 A brief article I wrote on evaluating different Sabermetric pitching statistics for predicting future pitcher performance.


The Design of Visual Recognition Software to Analyze Deep Brain Stimulation Video
Xavi Loinaz, Eric Sabelman
Poster for Advanced Authentic Research Showcase, May 2017 Back in high school, I was a research intern in the neurosurgery department of Kaiser Permanente at Redwood City, California. I developed software to automatically parse video of deep brain stimulation based off the importance of certain parts of the surgical procedure. The image above is a snpashot of the type of video content I worked with.


Selected Other Projects


Characterization of somatic events for diffuse large B-cell lymphoma in the Clinical Trial Sequencing Project
As the Getz Lab's lead analyst of the NIH-funded Clinical Trial Sequencing Project (CTSP) on diffuse large B-cell lymphoma, I fixed pipelines and analyzed somatic mutations, indels, copy number alterations, structural variants, and mutational signatures in 138 diffuse large B-cell lymphoma (DLBCL) samples. Had poster presentation for 2021 Broad Retreat Scientific Retreat; content is unable to be posted due to consortium rules. Project is progressing towards a manuscript.

Characterization and concordance analysis for cell lines of the Human Cancer Models Initiative
Acted as the Getz Lab's lead analyst for the NIH-funded Human Cancer Models Initiative to develop next-generation cancer models that better represent the diversity and complexity of human cancers. I focused on analysis relating to structural variation and quality control, as well as concordance of mutational signatures between tumor samples and corresponding models. Anticipated submission for flagship manuscript is by the end of January 2025.

Getz Lab's Structural Variant-Calling Pipeline
I rewrote and maintain the tumor structural variant-calling pipeline of the Getz Lab (50+-person lab). I also implemented a scatter/gather parallelization of the SV-calling tool SvABA, reducing SvABA’s runtime by a factor of ~10, and additionally have patched several bugs in the pipeline.

Neurophysiological Experiments Data Synchronization System
For spring semester of my junior year in college, I had an independent study research project where I developed a proof-of-concept system for data synchronization between video and various electrophysiological sensors for studies in Prof. David Borton's lab at Brown. I read in and decoded LTC signals from Timecode Systems devices on an Arduino and wrote software to test synchronization. I also resolved a problem in drift between GoPro and behavioral sensor data that had affected one of the lab's projects and had been unresolved for about a year.

Are.na Recommendation System
For our final project in Brown's upper-level Data Science (CSCI 1951A) course, we developed a recommendation system for the art-sharing website Are.na using singular value decomposition.

Histological Staining Quantification
In Prof. Jill Helms' lab at Stanford as a rising college sophomore, upon seeing how long it took to manually quantify amount of histological staining on microscope slides, I created software using OpenCV to automatically quantify staining using color thresholds.

TheGunnApp
Back in high school, I conceptualized and led the development and maintenance of an official school-sanctioned iOS app with useful features for my high school's community. It eventually got over 5,000 downloads and was regularly used by students, parents, and faculty. This was my first formative experience in seeing how immediate of an impact computational technologies I could develop could have on the people around me!
[Press]


Mentorship and Outreach


I love mentoring students and building both their enthusiasm and skills in research and academics! I have mentored three undergraduate students in Getz Lab, and used to be a classroom leader for the after-school volunteering program Buddies4Math.

Image Example
Me with Johnathan at the poster showcase
Me with my undergraduate intern, Johnathan, at the Getz Lab's end-of-summer intern poster showcase!

Additionally, I was part of the steering committee for RATalks, a group at the Broad Institute that hosts talks to the greater community. For this, I conceptualized and led planning of our very first alumni RATalk which sought to disseminate career and life perspective to the early-researcher community by a former associate director of the Broad Institute, increasing the availability of mentorship to junior researchers!


Reading and Writing


I love to read and average around 30 books per year. What I love most about reading is how it enables better understanding of those around us and their perspectives. I think this is especially important in research, where this can help me to be a better collaborator and also to create a more welcoming environment to others from diverse backgrounds. Some books that have inspired me:

Due in part to the perspectives I gained from my reading, I also wrote an essay about how to increase diversity in computer science-related disciplines, which incidentally helped earn me $159,000 of funding for my PhD via the NSF's CSGrad4US fellowship.


This website's design was based on a template by Shouvik Mani.