Language
English
Publication Date
12-31-2025
Journal
Genome Medicine
DOI
10.1186/s13073-025-01593-8
PMID
41470026
Abstract
Background: Copy number variation (CNV) is a class of genomic structural variation (SV) that contributes to genomic disorders and can significantly impact health. Short-read genome sequencing (sr-GS) enables genome-wide SV calling which has been shown to increase diagnosis in unsolved rare disease families. The growing number of large sequencing cohort projects with sr-GS data available requires open free analytical tools that provide visualization of CNV and SV integrated calls associated with gene annotation, proband-parent trio analysis to enable prioritization of de novo variants, B-allele frequency (BAF) plots to support CNV calls, parent of origin assessment and mosaicism detection.
Methods: To support those needs, we developed VizCNV, an open-source platform that incorporates read depth and BAF to enable haplotype-aware CNV analysis. The tool incorporates multiple interactive view modes for SV concurrent calls and annotation tracks for analyzing chromosomal abnormalities [e.g., aneuploidy, segmental aneusomy, and chromosome translocations], gene exonic rearrangements and non-coding gene regulatory regions. In addition, VizCNV includes a built-in filter schema for trio genomes, prioritizing the detection of de novo CNVs. We optimized VizCNV using 1000 Genomes Project data and benchmarked its performance against a cohort containing CNVs validated by multiple technologies. Finally, we applied VizCNV to a molecularly unsolved primary immunodeficiency disease cohort (PIDD, n = 39) previously analyzed by exome sequencing.
Results: Upon computational optimization, VizCNV achieved approximately 82.3% recall and 76.3% precision for deletions > 10 kb. VizCNV accurately detected all 71 validated copy number gains and correctly indicated potential underlying genomic complexities. Haplotype-aware CNV analysis identified a meiosis I non-disjunction event (trisomy 21), three de novo CNVs at two unique loci and 48 inherited candidate CNVs in the PIDD cohort of which 42% (20/48) were validated by integrated CNV/BAF analysis. Moreover, genotype-phenotype analyses revealed that a compound heterozygous combination of a paternal 12.8 kb deletion of exon 5 and a maternal missense variant allele of DOCK8 are the molecular cause of one proband diagnosed with Hyper-IgE syndrome.
Conclusions: VizCNV provides a robust and flexible platform for identification of aneuploidies, CNV, SV discovery and visualization of CNV and BAF data. It is also a useful tool to investigate features of genomic rearrangements such as parental origin which has implications for genetic counseling and mechanistic studies. The tool is freely available through https://doi.org/10.6084/m9.figshare.25869523.
Keywords
Deletions, Duplications, Genomic disorders, Mendelian diseases, Runs of homozygosity, Structural variation, Trisomy 21
Published Open-Access
yes
Recommended Citation
Du, Haowei; Lun, Ming Yin; Gagarina, Lidiia; et al., "An Integrated Platform for Concurrent Structural and Single-Nucleotide Variants Improves Copy-Number Detection and Reveals Pathogenic Alleles in Undiagnosed Mendelian Families" (2025). Faculty, Staff and Students Publications. 6233.
https://digitalcommons.library.tmc.edu/baylor_docs/6233