Research

Quantifying whole-chromosome aneuploidies in human IVF embryos

Aneuploidy (gain or loss of whole chromosomes) is the leading cause of pregnancy loss and congenital disorders. Only approximately half of all conceptions survive to live birth, primarily due to aneuploidy.

Using DNA data from a large sample of IVF embryos at day-5 of development, I identified gains and losses of each autosome, as well as the sex chromosomes. We find that each of these error modes is tolerated up to the day 5 stage, and that losses are less frequent on larger, gene-rich chromosomes. Taken together, these findings suggest that the fitness cost of aneuploidy might be due to the cumulative effects of disregulation across numerous genes on a chromosome.

I shared these initial results in a talk at the Biology of Genomes conference in May 2024, and we continue using this dataset to further investigate aneuploidies in early human development.

Testing for transmission distortion in a large sample of human sperm genomes

Mendel’s Law of Segregation states that the offspring of a diploid, heterozygous parent will inherit either allele with equal probability. While the vast majority of loci adhere to this rule, research in model and non-model organisms has uncovered numerous exceptions whereby “selfish” alleles are disproportionately transmitted to the next generation. Evidence of such transmission distortion (TD) in humans remains equivocal in part due to statistical and sample size limitations of past studies.

We leveraged single-cell sequencing data from over 40,000 individual sperm across 25 donors (5 donors with known infertility phenotypes and 20 donors of presumed normal fertility). This scan enabled us to look at an earlier developmental time point than had been considered in previous studies, with the possibility of avoiding the effects of downstream selection and other biases. Our results exhibited close concordance with binomial expectations under balanced transmission, in contrast to tenuous signals of TD that were previously reported in pedigree-based studies.

This work was highlighted in an eLife digest! See our paper and our code, and check Twitter for a summary of our findings!

rhapsodi: an R package to impute sparsely sequenced haploid genomes

The recent development of high-throughput single-cell genome sequencing of human sperm (termed "Sperm-seq") offers an opportunity to study various aspects of meiosis and inheritance with improved statistical power. However, the low sequencing coverage per cell (0.01x) necessitates the development of tailored statistical methods for recovering gamete genotypes.

To this end, we developed a method called rhapsodi (R haploid sperm/oocyte data imputation) that uses low-coverage single-cell DNA sequencing data from large samples of gametes to reconstruct phased donor haplotypes, impute gamete genotypes, and map meiotic recombination events. rhapsodi is introduced in our paper and is available as a user-friendly and thoroughly benchmarked R package.

Science Policy

From 2017 to 2019, I worked as a Science Policy Fellow at the Science and Technology Policy Institute (STPI), a federally-funded research and development center in Washington, DC. STPI receives funding from the NSF to conduct research and projects for the White House Office of Science and Technology Policy (OSTP). STPI also contracts with government agencies across the Executive Branch to offer science and technology analysis. These projects generally included research and literature review, interviews, and some project-specific analyses, followed by briefings, conference presentations, and formal publications. Check out some details about my work here.