Introduction
This page includes supplementary data for the Nucleic Acids Research paper, Hajarnavis, Korf and Durbin, A Probabilistic Model of 3' End Formation in Caenorhabditis elegans.
All data refers to WormBase Freeze WS110.
Confirmed sets relate to our training sets. Predicted sets are the results of our genome-wide scan.
DNA files
These are DNA dumps made from the chromosomal coordinates below.
Headers contain the gene name, the position of the AATAAA motif, and that of the cleavage site, which is also the length of the sequence.
Headers contain the gene name, position of AATAAA motif, posterior probability of the AATAA motif given the model, and (if there is EST evidence for that gene), the UTR length as determined by the 3' ESTs - UTR length as determined by a cleavage site prediction.
Chromosomal coordinates
This is the data submitted as supplementary material. Read about it here.
Software
Polyadenylation signal prediction software in Java and in C.
The Java software is a version of PAjHMMA where a C. elegans 3' end model has been hard coded.
