Each Wormpep release comes with the following additional files containing useful information:
Wormpep.table contains all the information from the Wormpep release, but without the protein sequences, in a format easy to parse. Every line contains a CDS identifier, and the following tabulator-separated entries: Wormpep accession number; locus; brief identification; EST/mRNA evidence for underlying CDS (either 'Confirmed' where there is complete EST/mRNA coverage, otherwise 'Predicted'); TREMBL or Swiss-Prot accession number; protein_id.
Wormpep.accession contains a list of all the Wormpep accession numbers ever assigned. Only if an accession number is currently in use, it will be followed by the CDS identifier(s) it is associated with. Some sequences have been assigned several different accession numbers in the past. If this is the case, the duplicated number will be followed by the accession number now used in Wormpep.
Wormpep.history has been built based on all the Wormpep releases starting with Wormpep8. Every CDS identifier is associated with an accession number, and a start and an end date in the form of a Wormpep version number. Every time a CDS prediction changes, a new entry is created for that CDS, with a start but without an end date. At the same time, the old entry gets assigned an end date. Searching the history.table file with a CDS identifier as query will show with which accession numbers and sequences this particular CDS has been associated with throughout the different Wormpep releases.
Wormpep.diff lists all the changes that have been introduced by the new wormpep release. You can find the CDS identifiers that have disappeared (lost), the CDS identifiers that have been added for the first time (new), the CDS identifiers of sequences thta have been modified (changed), and the CDS identifiers that have existed before but were absent in recent Wormpep releases (reappeared).
wp.fastaXXX contains all of the CDS predictions ever made by the sequencing consortium. This allows researchers to retrieve old (history) versions of the predictions.
Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK Tel:+44 (0)1223 834244
Last Modified Mon Mar 1 13:54:34 2010
Genome Research Limited is a charity registered in England with number 1021457



