Publication Date

9-11-2023

Journal

Genome Biology

DOI

10.1186/s13059-023-03022-8

PMID

37697406

PMCID

PMC10496407

PubMedCentral® Posted Date

9-11-2023

PubMedCentral® Full Text Version

Post-print

Published Open-Access

yes

Keywords

Humans, Animals, Hominidae, Segmental Duplications, Genomic, Telomere, Genomics, Chromosomes, Human, De-novo assembly, Segmental duplications, Long-read PacBio sequencing, Chromosomal fusion, Complex genomic rearrangements

Abstract

Resolving complex genomic regions rich in segmental duplications (SDs) is challenging due to the high error rate of long-read sequencing. Here, we describe a targeted approach with a novel genome assembler PhaseDancer that extends SD-rich regions of interest iteratively. We validate its robustness and efficiency using a golden-standard set of human BAC clones and in silico-generated SDs with predefined evolutionary scenarios. PhaseDancer enables extension of the incomplete complex SD-rich subtelomeric regions of Great Ape chromosomes orthologous to the human chromosome 2 (HSA2) fusion site, informing a model of HSA2 formation and unravelling the evolution of human and Great Ape genomes.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.