CNVPipe: An enhanced pipeline for accurate analysis of copy number variation from whole-genome sequencing

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

CNVPipe: An enhanced pipeline for accurate analysis of copy number variation from whole-genome sequencing

Authors

Sun, J.; Wong, N. K.; Jiang, Z.; Luk, H.-M.; Lo, F. M.; Di, D.; Zhang, S.; Wong, R. C.; Jin, W.; Yip, S. P.; Huang, C.-L.

Abstract

Copy number variations (CNVs) are critical contributors to the genetic architecture of complex diseases, yet many existing pipelines for whole-genome sequencing (WGS) data exhibit persistently high false discovery rates (FDR). Here, we introduce CNVPipe, an enhanced workflow that integrates widely used CNV-calling tools with a novel machine-learning framework to achieve lower FDR and higher sensitivity. CNVPipe also provides a specialised module for single-cell CNV analysis, featuring a new model that accurately estimates the ploidy of single cells to get integer copy number profile. Benchmark evaluations demonstrate that CNVPipe outperforms current pipelines in various genomic contexts. In addition to detecting recurrent CNVs in paediatric developmental disorders, CNVPipe enables large-scale functional genomics applications involving stem cell technologies. Moreover, its application to sparse single-cell DNA sequencing data provides a new aspect of research in cancer evolution. Collectively, these findings underscore the versatility and reliability of CNVPipe as a comprehensive solution for CNV detection in WGS-based research.

Follow Us on

0 comments

Add comment