Publication Date
1-1-2024
Journal
Nature Methods
DOI
10.1038/s41592-023-02069-6
PMID
38036856
PMCID
PMC11610747
PubMedCentral® Posted Date
12-2-2024
PubMedCentral® Full Text Version
Author MSS
Published Open-Access
yes
Keywords
Sequence Analysis, DNA, Genome, Genomics, Chromosome Mapping, High-Throughput Nucleotide Sequencing
Abstract
Complete, telomere-to-telomere genome assemblies promise improved analyses and the discovery of new variants, but many essential genomic resources remain associated with older reference genomes. Thus, there is a need to translate genomic features and read alignments between references. Here we describe a new method called levioSAM2 that accounts for reference changes and performs fast and accurate lift-over between assemblies using a whole-genome map. In addition to enabling the use of multiple references, we demonstrate that aligning reads to a high-quality reference (e.g. T2T-CHM13) and lifting to an older reference (e.g. GRCh38) actually improves the accuracy of the resulting variant calls on the old reference. By leveraging the quality improvements of T2T-CHM13, levioSAM2 reduces small-variant calling errors by 11.4-39.5% compared to GRC-based mapping using real Illumina datasets. LevioSAM2 also improves long-read-based structural variant calling and reduces errors from 3.8-11.8% for a PacBio HiFi dataset. Performance is especially improved for a set of complex medically-relevant genes, where the GRC references are lower quality. The software is available at https://github.com/milkschen/leviosam2 under the MIT license.

- Citations
- Citation Indexes: 11
- Usage
- Downloads: 8
- Abstract Views: 1
- Captures
- Readers: 51
- Mentions
- Blog Mentions: 1
- News Mentions: 1
Included in
Biological Phenomena, Cell Phenomena, and Immunity Commons, Biomedical Informatics Commons, Genetics and Genomics Commons, Medical Genetics Commons, Medical Molecular Biology Commons, Medical Specialties Commons