HGSC Sequenced Eukaryotic Organisms
Green - low coverage
Blue - high quality genome
Projects
The table below shows all genomes sequenced by the HGSC. The main "initial projects" are shown while ongoing efforts to "upgrade" sequences or identify genetic variation are described later. An important contribution of the HGSC is continual development of sequence strategies and software that improve draft assembly quality. After the human, the rat project showcased the "Combined Assembly" method, where BACs were "skimmed" and combined with WGS. We later developed Clone-Array Pooled Shotgun Sequencing (CAPSS) that greatly simplified introduction of BAC components. With the Sea Urchin and Bovine projects, CAPSS integrated with Combined Assembly became a routine production tool.
| Species | Size | Goal | Sequencing | Project Status | Methods | % BCM |
|---|---|---|---|---|---|---|
| Human (H. sapiens) | 2.9 Gb | Finished | Complete | Complete | Clone by clone | 10.5 |
| Mouse (M. musculus) | 2.6 Gb | Finished | Complete | Complete | Clone by clone | <1.0 |
| Rat (R. norvegicus) | 2.8 Gb | Draft | 7x Complete | Complete | Mixed | 50 |
| Fly (D. melanogaster) | 130 Mb | Finished | 10x Complete | Complete | WGS/BAC | 35 |
| Fly (D. pseudoobscura) | 130 Mb | Draft | 8x Complete | Complete | WGS | 100 |
| Fly (D. pmelanogaster -multiple strains) | 200 Mb | Draft | In progress | In progress | WGS | 100 |
| Honey bee (A. mellifera) | 238 Mb | Draft | 7x Complete | Complete | WGS | 100 |
| cDNAs (H. sapiens) | 12,000 | Finished | N/A | Complete | CCS | 30 |
| cDNAs (M. musculus) | 10,000 | Finished | N/A | Complete | CCS | 30 |
| Sea urchin (S. purpuratus) | 800 Mb | Draft | 7x Complete | Analysis | WGS/CAPSS | 100 |
| Bovine (B. taurus) | 2.9 Gb | Draft | 7x Complete | Analysis | WGS/CAPSS | 100 |
| Rhesus monkey (M. mulatta) | 2.9 Gb | Draft | 5.1x Complete | Analysis | WGS/CAPSS | ~40 |
| Orangutan (P. pygmaeus) | 2.9 Gb | Draft | 6x Complete | Assembly | WGS | 50 |
| Marmoset (C. jacchus) | 2.9 Gb | Draft | 7x Complete | Assembly | WGS | 50 |
| Pea aphid (A. pisum) | 540 Mb | Draft | In progress | In progress | WGS | 100 |
| Wasp (Nasonia spp.) | 330 Mb | Draft | In progress | In progress | WGS | 100 |
| Beetle (T. castaneum) | 155 Mb | Draft | 7x Complete | Analysis | WGS | 100 |
| Wallaby (M. eugenii) | 3.6 Gb | 2x Draft | In progress | In progress | WGS | 50 |
| Acorn worm (S. kowalevskii) | 1.1 Gb | Draft | In progress | In progress | WGS | 100 |
| Hyrax (P. capensis) | 2.9 Gb | 2x | In progress | In progress | WGS | 100 |
| Megabat (P. vampyrus) | 2.9 Gb | 2x Draft | In progress | In progress | WGS | 100 |
| Armadillo (D. novemcinctus) | 3 Gb | Draft | 6x | In progress | WGS | 66 |
| Baboon (P. hamadryas) | 3 Gb | Draft | 6x | In progress | WGS | 100 |
| Bumble bee (B. terrestris) | 300 Mb | Draft | 15x | In progress | WGS | 100 |
| California mouse (P. californicus) | 3 Gb | Draft | 2x | In progress | WGS | 100 |
| Centipede (S. maritima) | 300 Mb | Draft | 6x | In progress | WGS | 100 |
| Cotton bollworm (H. armigera) | 500 Mb | Draft | 15x | In progress | WGS | 100 |
| Deer mouse (P. maniculatus) | 3 Gb | Draft | 6x | In progress | WGS | 100 |
| Gibbon, Northern white-cheeked (N. leucogenys) | 3 Gb | Draft | 6x | In progress | WGS | 50 |
| Dolphin, Bottlenose (T. truncatus) | Draft | 2.5x | In progress | WGS | 100 | |
| Dwarf honey bee (A. florea) | 300 Mb | Draft | 15x | In progress | WGS | 100 |
| Hessian fly (M. destructor) | 200 Mb | Draft | 15x | In progress | WGS | 100 |
| Kangaroo rat (D. ordii) | 3 Gb | Draft | 2x | In progress | WGS | 100 |
| Lemur, Gray Mouse (M. Murinus) | 3 Gb | Draft | 6x | In progress | WGS | 66 |
| Oldfield mouse (P. polionotus) | 3 Gb | Draft | 2x | In progress | WGS | 100 |
| Sandfly (L. longipalpis) | 300 Mb | Draft | 15x | In progress | WGS | 100 |
| White-footed mouse (P. leucopus) | 3 Gb | Draft | 2x | In progress | WGS | 100 |
Table: Genomes at the BCM-HGSC, past and present. Finished = Bermuda standard; Draft = deep draft coverage >6x of raw data; Mixed = WGS + BAC clone approach; WGS = whole genome shotgun; CCS = Concatenated cDNA Sequencing; CAPSS = Clone Array Pool Shotgun Sequencing; PGI = Pooled Genome Indexing.
