Improved sequence mapping using a complete reference genome and lift-over.

TitleImproved sequence mapping using a complete reference genome and lift-over.
Publication TypeJournal Article
Year of Publication2024
AuthorsChen, N-C, Paulin, LF, Sedlazeck, FJ, Koren, S, Phillippy, AM, Langmead, B
JournalNat Methods
Volume21
Issue1
Pagination41-49
Date Published2024 Jan
ISSN1548-7105
KeywordsChromosome Mapping, Genome, Genomics, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA
Abstract

Complete, telomere-to-telomere (T2T) genome assemblies promise improved analyses and the discovery of new variants, but many essential genomic resources remain associated with older reference genomes. Thus, there is a need to translate genomic features and read alignments between references. Here we describe a method called levioSAM2 that performs fast and accurate lift-over between assemblies using a whole-genome map. In addition to enabling the use of several references, we demonstrate that aligning reads to a high-quality reference (for example, T2T-CHM13) and lifting to an older reference (for example, Genome reference Consortium (GRC)h38) improves the accuracy of the resulting variant calls on the old reference. By leveraging the quality improvements of T2T-CHM13, levioSAM2 reduces small and structural variant calling errors compared with GRC-based mapping using real short- and long-read datasets. Performance is especially improved for a set of complex medically relevant genes, where the GRC references are lower quality.

DOI10.1038/s41592-023-02069-6
Alternate JournalNat Methods
PubMed ID38036856
PubMed Central ID5411779
Grant ListR01 HG011392 / HG / NHGRI NIH HHS / United States
R35 GM139602 / GM / NIGMS NIH HHS / United States

Similar Publications