HGSC Sequenced Eukaryotic Organisms

Green - low coverage
Blue - high quality genome

Projects

The table below shows all genomes sequenced by the HGSC. The main "initial projects" are shown while ongoing efforts to "upgrade" sequences or identify genetic variation are described later. An important contribution of the HGSC is continual development of sequence strategies and software that improve draft assembly quality. After the human, the rat project showcased the "Combined Assembly" method, where BACs were "skimmed" and combined with WGS. We later developed Clone-Array Pooled Shotgun Sequencing (CAPSS) that greatly simplified introduction of BAC components. With the Sea Urchin and Bovine projects, CAPSS integrated with Combined Assembly became a routine production tool.

SpeciesSizeGoalSequencingProject StatusMethods% BCM
Human (H. sapiens) 2.9 GbFinishedCompleteCompleteClone by clone10.5
Mouse (M. musculus) 2.6 GbFinishedCompleteCompleteClone by clone<1.0
Rat (R. norvegicus) 2.8 GbDraft7x CompleteCompleteMixed50
Fly (D. melanogaster) 130 MbFinished10x CompleteCompleteWGS/BAC35
Fly (D. pseudoobscura) 130 MbDraft8x CompleteCompleteWGS100
Fly (D. pmelanogaster -multiple strains) 200 MbDraftIn progressIn progressWGS100
Honey bee (A. mellifera) 238 MbDraft7x CompleteCompleteWGS100
cDNAs (H. sapiens) 12,000FinishedN/ACompleteCCS30
cDNAs (M. musculus) 10,000FinishedN/ACompleteCCS30
Sea urchin (S. purpuratus) 800 MbDraft7x CompleteAnalysisWGS/CAPSS100
Bovine (B. taurus) 2.9 GbDraft7x CompleteAnalysisWGS/CAPSS100
Rhesus monkey (M. mulatta) 2.9 GbDraft5.1x CompleteAnalysisWGS/CAPSS~40
Orangutan (P. pygmaeus) 2.9 GbDraft6x CompleteAssemblyWGS50
Marmoset (C. jacchus) 2.9 GbDraft7x CompleteAssemblyWGS50
Pea aphid (A. pisum) 540 MbDraftIn progressIn progressWGS100
Wasp (Nasonia spp.) 330 MbDraftIn progressIn progressWGS100
Beetle (T. castaneum) 155 MbDraft7x CompleteAnalysisWGS100
Wallaby (M. eugenii) 3.6 Gb2x DraftIn progressIn progressWGS50
Acorn worm (S. kowalevskii) 1.1 GbDraftIn progressIn progressWGS100
Hyrax (P. capensis) 2.9 Gb2xIn progressIn progressWGS100
Megabat (P. vampyrus) 2.9 Gb2x DraftIn progressIn progressWGS100
Armadillo (D. novemcinctus) 3 GbDraft6xIn progressWGS66
Baboon (P. hamadryas) 3 GbDraft6xIn progressWGS100
Bumble bee (B. terrestris) 300 MbDraft15xIn progressWGS100
California mouse (P. californicus) 3 GbDraft2xIn progressWGS100
Centipede (S. maritima) 300 MbDraft6xIn progressWGS100
Cotton bollworm (H. armigera) 500 MbDraft15xIn progressWGS100
Deer mouse (P. maniculatus) 3 GbDraft6xIn progressWGS100
Gibbon, Northern white-cheeked (N. leucogenys) 3 GbDraft6xIn progressWGS50
Dolphin, Bottlenose (T. truncatus) Draft2.5xIn progressWGS100
Dwarf honey bee (A. florea) 300 MbDraft15xIn progressWGS100
Hessian fly (M. destructor) 200 MbDraft15xIn progressWGS100
Kangaroo rat (D. ordii) 3 GbDraft2xIn progressWGS100
Lemur, Gray Mouse (M. Murinus) 3 GbDraft6xIn progressWGS66
Oldfield mouse (P. polionotus) 3 GbDraft2xIn progressWGS100
Sandfly (L. longipalpis) 300 MbDraft15xIn progressWGS100
White-footed mouse (P. leucopus) 3 GbDraft2xIn progressWGS100

Table: Genomes at the BCM-HGSC, past and present. Finished = Bermuda standard; Draft = deep draft coverage >6x of raw data; Mixed = WGS + BAC clone approach; WGS = whole genome shotgun; CCS = Concatenated cDNA Sequencing; CAPSS = Clone Array Pool Shotgun Sequencing; PGI = Pooled Genome Indexing.

Human

Human

The Human Genome Project (HGP) was an international effort to sequence and annotate the entire estimated 3.3 billion bases of the human genome. The project was conceived in the mid-80s, and it began in 1990.

The HGP effort at HGSC was completed in 2006, with chromosomes 3, 12 and 30 Mb of X (~10% of the genome).

Genome Data