Language
English
Publication Date
1-1-2024
Journal
Nature Methods
DOI
10.1038/s41592-023-02069-6
PMID
38036856
PMCID
PMC11610747
PubMedCentral® Posted Date
12-2-2024
PubMedCentral® Full Text Version
Author MSS
Abstract
Complete, telomere-to-telomere genome assemblies promise improved analyses and the discovery of new variants, but many essential genomic resources remain associated with older reference genomes. Thus, there is a need to translate genomic features and read alignments between references. Here we describe a new method called levioSAM2 that accounts for reference changes and performs fast and accurate lift-over between assemblies using a whole-genome map. In addition to enabling the use of multiple references, we demonstrate that aligning reads to a high-quality reference (e.g. T2T-CHM13) and lifting to an older reference (e.g. GRCh38) actually improves the accuracy of the resulting variant calls on the old reference. By leveraging the quality improvements of T2T-CHM13, levioSAM2 reduces small-variant calling errors by 11.4-39.5% compared to GRC-based mapping using real Illumina datasets. LevioSAM2 also improves long-read-based structural variant calling and reduces errors from 3.8-11.8% for a PacBio HiFi dataset. Performance is especially improved for a set of complex medically-relevant genes, where the GRC references are lower quality. The software is available at https://github.com/milkschen/leviosam2 under the MIT license.
Keywords
Sequence Analysis, DNA, Genome, Genomics, Chromosome Mapping, High-Throughput Nucleotide Sequencing
Published Open-Access
yes
Recommended Citation
Chen, Nae-Chyun; Paulin, Luis F; Sedlazeck, Fritz J; et al., "Improved Sequence Mapping Using a Complete Reference Genome and Lift-Over" (2024). Faculty and Staff Publications. 2302.
https://digitalcommons.library.tmc.edu/baylor_docs/2302
Included in
Biological Phenomena, Cell Phenomena, and Immunity Commons, Biomedical Informatics Commons, Genetics and Genomics Commons, Medical Genetics Commons, Medical Molecular Biology Commons, Medical Specialties Commons