A blended genome and exome sequencing method captures genetic variation in an unbiased, high-quality, and cost-effective manner

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

A blended genome and exome sequencing method captures genetic variation in an unbiased, high-quality, and cost-effective manner

Authors

Boltz, T. A.; Chu, B. B.; Liao, C.; Sealock, J. M.; Ye, R.; Majara, L.; Fu, J. M.; Service, S.; Zhan, L.; Medland, S. E.; Chapman, S. B.; Rubinacci, S.; DeFelice, M.; Grimsby, J. L.; Abebe, T.; Alemayehu, M.; Ashaba, F. K.; Atkinson, E. G.; Bigdeli, T.; Bradway, A. B.; Brand, H.; Chibnik, L. B.; Fekadu, A.; Gatzen, M.; Gelaye, B.; Gichuru, S.; Gildea, M. L.; Hill, T. C.; Huang, H.; Hubbard, K. M.; Injera, W. E.; James, R.; Joloba, M.; Kachulis, C.; Kalmbach, P. R.; Kamulegeya, R.; Kigen, G.; Kim, S.; Koen, N.; Kwobah, E. K.; Kyebuzibwa, J.; Lee, S.; Lennon, N. J.; Lind, P. A.; Lopera-Maya, E.

Abstract

We deployed the Blended Genome Exome (BGE), a DNA library blending approach that generates low pass whole genome (1-4x mean depth) and deep whole exome (30-40x mean depth) data in a single sequencing run. This technology is cost-effective, empowers most genomic discoveries possible with deep whole genome sequencing, and provides an unbiased method to capture the diversity of common SNP variation across the globe. To evaluate this new technology at scale, we applied BGE to sequence >53,000 samples from the Populations Underrepresented in Mental Illness Associations Studies (PUMAS) Project, which included participants across African, African American, and Latin American populations. We evaluated the accuracy of BGE imputed genotypes against raw genotype calls from the Illumina Global Screening Array. All PUMAS cohorts had R2 concordance >=95% among SNPs with MAF>=1%, and never fell below >=90% R2 for SNPs with MAF<1%. Furthermore, concordance rates among local ancestries within two recently admixed cohorts were consistent among SNPs with MAF>=1%, with only minor deviations in SNPs with MAF<1%. We also benchmarked the discovery capacity of BGE to access protein-coding copy number variants (CNVs) against deep whole genome data, finding that deletions and duplications spanning at least 3 exons had a positive predicted value of ~90%. Our results demonstrate BGE scalability and efficacy in capturing SNPs, indels, and CNVs in the human genome at 28% of the cost of deep whole-genome sequencing. BGE is poised to enhance access to genomic testing and empower genomic discoveries, particularly in underrepresented populations.

Follow Us on

0 comments

Add comment