Structured Pooling Improves Detection of Rare Regulatory Mutations in Population-Scale Reporter Assays

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

Structured Pooling Improves Detection of Rare Regulatory Mutations in Population-Scale Reporter Assays

Authors

Dura, K.; Siklenka, K.; Strouse, K. P.; Morrow, S.; Zhang, C.; Barrera, A.; Allen, A. S.; Reddy, T. E.; Majoros, W. H.

Abstract

Identifying genetic variants in noncoding DNA that impact gene expression and thereby contribute to disease risk remains a difficult but important challenge in genomic medicine. Modern reporter assays such as STARR-seq and MPRA provide an efficient and effective means of testing, in very high throughput, millions of variants captured directly from patient genomes. While these assays have previously been scaled to whole genomes and, separately, to populations, we report findings from the first whole-genome population-scale STARR-seq experiment performed on 100 individuals. In order to achieve that scale we devised a novel experimental design that partitions samples into pools so as to increase allele frequencies within pools and thereby reduce expected dropout and increase signal-to-noise ratio in experimental readouts. We show that this design produces more accurate estimates of variant effect sizes, and we provide a Bayesian model for robust estimation of those effect sizes that also reports full posterior distributions for assessment of confidence in estimates. Together, these methodological innovations facilitate the detection of functional regulatory variants, particularly rare variants, with much higher accuracy and at greater scale than previously possible. We demonstrate the utility of this approach on the task of functional annotation of quantitative trait loci such as eQTLs and caQTLs, and show concordance with patterns of constraint in transcription factor binding profiles.

Follow Us on

0 comments

Add comment