Title | Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus). |
Publication Type | Journal Article |
Year of Publication | 2017 |
Authors | Larsen, PA, Harris, RA, Liu, Y, Murali, SC, C Campbell, R, Brown, AD, Sullivan, BA, Shelton, J, Brown, SJ, Raveendran, M, Dudchenko, O, Machol, I, Durand, NC, Shamim, MS, Aiden, ELieberman, Muzny, DM, Gibbs, RA, Yoder, AD, Rogers, J, Worley, KC |
Journal | BMC Biol |
Volume | 15 |
Issue | 1 |
Pagination | 110 |
Date Published | 2017 Nov 16 |
ISSN | 1741-7007 |
Keywords | Animals, Centromere, Cheirogaleidae, Computational Biology, Female, Genome, High-Throughput Nucleotide Sequencing, Sequence Analysis, DNA |
Abstract | BACKGROUND: The de novo assembly of repeat-rich mammalian genomes using only high-throughput short read sequencing data typically results in highly fragmented genome assemblies that limit downstream applications. Here, we present an iterative approach to hybrid de novo genome assembly that incorporates datasets stemming from multiple genomic technologies and methods. We used this approach to improve the gray mouse lemur (Microcebus murinus) genome from early draft status to a near chromosome-scale assembly.METHODS: We used a combination of advanced genomic technologies to iteratively resolve conflicts and super-scaffold the M. murinus genome.RESULTS: We improved the M. murinus genome assembly to a scaffold N50 of 93.32 Mb. Whole genome alignments between our primary super-scaffolds and 23 human chromosomes revealed patterns that are congruent with historical comparative cytogenetic data, thus demonstrating the accuracy of our de novo scaffolding approach and allowing assignment of scaffolds to M. murinus chromosomes. Moreover, we utilized our independent datasets to discover and characterize sequences associated with centromeres across the mouse lemur genome. Quality assessment of the final assembly found 96% of mouse lemur canonical transcripts nearly complete, comparable to other published high-quality reference genome assemblies.CONCLUSIONS: We describe a new assembly of the gray mouse lemur (Microcebus murinus) genome with chromosome-scale scaffolds produced using a hybrid bioinformatic and sequencing approach. The approach is cost effective and produces superior results based on metrics of contiguity and completeness. Our results show that emerging genomic technologies can be used in combination to characterize centromeres of non-model species and to produce accurate de novo chromosome-scale genome assemblies of complex mammalian genomes. |
DOI | 10.1186/s12915-017-0439-6 |
Alternate Journal | BMC Biol |
PubMed ID | 29145861 |
PubMed Central ID | PMC5689209 |
Grant List | P20 GM103418 / GM / NIGMS NIH HHS / United States S10 OD018164 / OD / NIH HHS / United States R01 GM124041 / GM / NIGMS NIH HHS / United States U54 HG003273 / HG / NHGRI NIH HHS / United States U54 HG003273 / HG / NHGRI NIH HHS / United States |
Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus).
Similar Publications
Deep sequencing of candidate genes identified 14 variants associated with smoking abstinence in an ethnically diverse sample. Sci Rep. 2024;14(1):6385. | .
FAIR Header Reference genome: a TRUSTworthy standard. Brief Bioinform. 2024;25(3). | .
Germline Genetic Testing and Survival Outcomes Among Children With Rhabdomyosarcoma: A Report From the Children's Oncology Group. JAMA Netw Open. 2024;7(3):e244170. | .