Drosophila Genetic Reference Panel
Users are free to use the data in scientific papers analyzing particular
genes and regions if the providers of this data are properly acknowledged.
Please cite the BCM-HGSC web site or publications from BCM-HGSC referring
to the genome sequence. BCM HGSC plans to publish the assembly and genomic
annotation of the dataset, including genome wide molecular population
genetic analyses, large-scale identification of regions of evolutionary
conservation and quantitative trait association studies. This is in
accordance with, and with the understandings in the Fort Lauderdale meeting
discussing Community Resource Projects and the resulting NHGRI policy
statement. (www.genome.gov).
Note: all *.txt.gz files in the Illumina/ subdirectories are in FASTQ
format, though they contain "raw" Illumina-encoded quality values.
Unfortunately, Illumina changed the manner in which they encode
quality values in an update to their software. As such, all files
with datestamps of 090223 (Feb 23, 2009) and newer were created by
GAPipeline version 1.3, with the "newer" method of quality encoding.
All other files with earlier datestamps are from the GAPipeline
version 1.1 quality encoding. For more information, see:
http://en.wikipedia.org/wiki/FASTQ_format
However, the accompanying .fastq.gz files are in standardized,
Phred-scaled format. Both are provided for your convenience.
Apache/2.2 Server at www.hgsc.bcm.tmc.edu Port 80