The FTP site contains datasets for the main subsets of data:
- BACs (sequences of the BACs sequenced at by the Sanger Institute, these inlcude the three bloodstream expression sites)
- Contig sequences (contig sequences and of chromosomes I, IX and X), further divided into:
- DNA sequence
- predicted peptides
- GSS sequences
- P1s (sequences of the P1's mapped onto chromosome I)
The data in these directories includes both finished sequences
already submitted to nucleotide sequence databases as well as
preliminary unfinished sequences. Release of data is made as and when
appropriate.
 |
 |
 |
 |
 |
 |
 |
Please note:
- Within each ftp directory, there will usually be several files.
These are generated periodically, so make sure that you obtain the
latest version. The file name contains the date:
tryp1_reads_000203.gz
is a gzip compressed file, made 00-02-03 (3rd Feb 2000) of chromosome 1 sequence reads for example.
- As the number of sequence reads increases, fewer, but larger,
contigs will be assembled. Sequence contig numbers will not stay
the same between successive assemblies
- It is easiest to use your Netscape browser to access these
data. You can download data by clicking on the relevant file whilst
holding down the Shift-key or follow the relevant link, allow the
browser to complete loading the file and then save (Alt-S in
Netscape).
|
 |
 |
 |