The purpose of a B. saltans genome sequence is to provide an 'outgroup' for comparative genomic studies. As it is among the closest bodonid relatives of the trypanosomatids, it will provide a model of the ancestral trypanosomatid to distinguish those derived parts of the parasite genomes (i.e., unique trypanosomatid adaptations) from those which are a legacy of the free-living ancestor. This objective can be resolved into three principal comparative issues:
1. Trypanosomatid disease; understanding how human trypanosomatid parasites acquired their distinct pathological strategies;
2. Evolution of parasitism; understanding how the ancestral trypanosomatid became parasitic in terms of derived innovations (e.g., cell surfaces) and loss of genomic repertoire;
3. Eukaryotic evolution; understanding how typical kinetoplastid features (e.g., glycosomes) evolved and how these might have been modified for parasitism.
Published Genome Data
A pilot study that described ~400kbp of B. saltans genome sequenced from a whole genome fosmid library has been published (Jackson, Quail & Berriman 2008, BMC Genomics, 9:594). A draft sequence for the entire B. saltans genome has now been produced using Illumina HiSeq technology. The data were derived from two libraries of 300bp and 3kbp insert sizes respectively. Sequence assembly was carried out using Velvet and subsequently corrected for misassembly using large-insert reads and custom scripts.
The draft genome sequence contains 39,864,435bp at ~110x coverage, arranged into 2,256 scaffolds. The average scaffold length is 16,804.54bp and the largest scaffold is 190,927bp long. N50 = 31,548bp (n=372).
The draft genome annotation contains 18,963 coding sequences. When analyzed using CEGMA, this covers 78% of the conserved sequence set.
In addition, we have produced a draft transcriptome using RNAseq technology. 85.5% of reads mapped to the genome assembly. The RNAseq reads were assembled using Velvet, producing 8,901 putative transcripts and transcript fragments.
All data can be downloaded via our ftp site: ftp://ftp.sanger.ac.uk/pub/pathogens/Bodo/saltans
and searched using the GeneDB BLAST server: http://www.genedb.org/blast/submitblast/GeneDB_Bsaltans
Bodo saltans RNAseq
Sample | Strain | Run Accession |
|---|---|---|
| gRNA1 | Lake Konstanz | ERR152949 |
| gRNA2 | Lake Konstanz | ERR152950 |
| gRNA3 | Lake Konstanz | ERR152951 |
| gRNA4 | Lake Konstanz | ERR152952 |
Bodo saltans genome sequencing project
Sample | Strain | Run Accession |
|---|---|---|
| Bodo gDNA 1 | Lake Konstanz | ERR036176 |
| WGAb9 | Lake Konstanz | ERR036177 |
| WGAb357 | Lake Konstanz | ERR036178 |
| Fosmid196 | Lake Konstanz | ERR037084 |
| BodoDNA1 | Lake Konstanz | ERR038657 |
| BodoDNA1 | Lake Konstanz | ERR037085 |
| BodoDNA2 | Lake Konstanz | ERR037086 |
| BodoDNA3 | Lake Konstanz | ERR037087 |
| Bodo_gDNA_3kb | Lake Konstanz | ERR103046 |
| Bodo_gDNA_5kb | Lake Konstanz | ERR103045 |
| BET3 | Lake Konstanz | ERR144643 |
| BET3 | Lake Konstanz | ERR302449 |
| BET2 | Lake Konstanz | ERR144642 |
| BET2 | Lake Konstanz | ERR302448 |
| BET1 | Lake Konstanz | ERR144641 |
| BET1 | Lake Konstanz | ERR302447 |
| BE3 | Lake Konstanz | ERR144640 |
| BE3 | Lake Konstanz | ERR302446 |
| BE2 | Lake Konstanz | ERR144639 |
| BE2 | Lake Konstanz | ERR302445 |
| BE1 | Lake Konstanz | ERR144638 |
| BE1 | Lake Konstanz | ERR302444 |
| B1_ctrl | Lake Konstanz | ERR144637 |
| B1_ctrl | Lake Konstanz | ERR302443 |


