Contact WTSI Webmaster Printer friendly format Login to WTSI resources WTSI RSS feed
All Sequencing
  • Human (HGP)
  • Pathogens
  • Blast
  • C. elegans
  • Overview
  • Sequence data
  • BLAST search
  • Wormpep
  • FTP site
  • C. briggsae
  • C. briggsae project
  • BLAST Search
  • WormBase
  • Release info
  • Current gene names
  • Submit data
  • GFF files
  • Documentation
  • Annotation
  • Website

  • Ensembl
  • C. elegans project
  • Website Search
  • People Search
  • Library Services
  • Site Map
  • Feedback / Help
Retrieve BLAST result
More Wormpep Information

Each Wormpep release comes with the following additional files containing useful information:

Wormpep.table contains all the information from the Wormpep release, but without the protein sequences, in a format easy to parse. Every line contains a CDS identifier, and the following tabulator-separated entries: Wormpep accession number; locus; brief identification; EST/mRNA evidence for underlying CDS (either 'Confirmed' where there is complete EST/mRNA coverage, otherwise 'Predicted'); TREMBL or Swiss-Prot accession number; protein_id.

Wormpep.accession contains a list of all the Wormpep accession numbers ever assigned. Only if an accession number is currently in use, it will be followed by the CDS identifier(s) it is associated with. Some sequences have been assigned several different accession numbers in the past. If this is the case, the duplicated number will be followed by the accession number now used in Wormpep.

Wormpep.history has been built based on all the Wormpep releases starting with Wormpep8. Every CDS identifier is associated with an accession number, and a start and an end date in the form of a Wormpep version number. Every time a CDS prediction changes, a new entry is created for that CDS, with a start but without an end date. At the same time, the old entry gets assigned an end date. Searching the history.table file with a CDS identifier as query will show with which accession numbers and sequences this particular CDS has been associated with throughout the different Wormpep releases.

Wormpep.diff lists all the changes that have been introduced by the new wormpep release. You can find the CDS identifiers that have disappeared (lost), the CDS identifiers that have been added for the first time (new), the CDS identifiers of sequences thta have been modified (changed), and the CDS identifiers that have existed before but were absent in recent Wormpep releases (reappeared).

wp.fastaXXX contains all of the CDS predictions ever made by the sequencing consortium. This allows researchers to retrieve old (history) versions of the predictions.



Information Projects Other Services
Sanger Home
Sitemap
Site Search
Information
Careers
Press
News
Seminars
Workshops
Publications
Staff Theses
Travel Directions
Research Teams
Research Faculty
Personnel Search
Human Genetics
Model Organism Genetics
Pathogen Genetics
Bioinformatics
Sequencing
Library
Helpdesk
Webmail
VPN Access
Sign In
SSO Pass. Reset

webmaster@sanger.ac.uk

Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK  Tel:+44 (0)1223 834244

Last Modified Mon Mar 1 13:54:34 2010

Genome Research Limited is a charity registered in England with number 1021457

Data Sharing Policy | Conditions of Use | Copyright