The Sanger Institute has completed the sequencing of Clostridium difficile strain 630 (epidemic type X), a major nosocomial pathogen world-wide causing disease ranging from antibiotic-associated diarrhoea to life-threatening pseudomembranous colitis. We are working in collaboration with Brendan Wren of the London School of Hygiene and Tropical Medicine, Neil Fairweather and Gordon Dougan of the Centre for Molecular Microbiology and Infection, Imperial College, London and Peter Mullany of the Eastman Dental Institute, London.
.Strain 630 is multi-drug resistant, and was isolated from a patient with severe PMC that had spread to dozens of other patients on the same ward in Zurich, Switzerland in 1982. Thus strain 630 has the genetic attributes of a fully virulent, highly transmissible, drug resistant strain.
The genome consists of a circular chromosome of 4,290,252 bp with a G+C content of 29.06%,
and a circular plasmid of 7,881 bp with a G+C content of 27.9%.
The current gene prediction and annotation (EMBL format) can be downloaded here , and can be searched through the BLAST server.
The sequence and analysis are described in Sebaihia et al. (2006), Nature Genetics 38, 779 - 786 and has been submitted to EMBL/GenBank with the accession number AM180355.
A database containing the sequence data is available by FTP and can be searched using our BLAST server.
We have also generated sequence data from C. difficile strain R20291 (SM) using the 454/Roche GS20 system.
C. difficile strain R20291 was isolated in Stoke Mandeville, UK, and is closely related to the North American hypervirulent BI strains.
The sequence is in 546 contigs with a total size of 4.049 Mb long and may be downloaded from our ftp site.
Please note that this sequence is unchecked and unedited, and will contain errors. The GS20 is known to have problems with homopolymeric tracts, and these are therefore likely to contain significant numbers of errors.
The Sanger Institute has begun shotgun sequencing of seven further strains of C. difficile as follows :
C. difficile strain 196
- Isolated: France, 1985 (Ribotype 027)
- Assembly: 1 contigs >1kb (1 contigs >2kb); total size 4.111 Mb
- Shotgun: 33,509 reads totalling 20.887 Mb, theoretical genome coverage of 99.26%
C. difficile strain BI1
- Isolated: USA, 1988 (Ribotype 027)
- Assembly: 19 contigs >1kb (10 contigs >2kb); total size 4.474 Mb
- Shotgun: 34,428 reads totalling 21.622 Mb, theoretical genome coverage of 99.33%
C. difficile strain M120
- Isolated: UK, 2007 (Ribotype 078)
- Assembly: 35 contigs >1kb (20 contigs >2kb); total size 4.104 Mb
- Shotgun: 32,381 reads totalling 21.754 Mb, theoretical genome coverage of 99.33%
C. difficile strain M68
- Isolated: Ireland, 2006 (Ribotype 017)
- Assembly: 814 contigs >1kb (421 contigs >2kb); total size 5.480 Mb
- Shotgun: 29,958 reads totalling 18.975 Mb, theoretical genome coverage of 98.77%
C. difficile strain SM (R20291)
- Isolated: UK, 2006 (Ribotype 027)
- Assembly: 69 contigs >1kb (49 contigs >2kb); total size 4.230 Mb
- Shotgun: 34,018 reads totalling 22.599 Mb, theoretical genome coverage of 99.45%
C. difficile strain CF5
- Isolated: Belgium, 1995 (Ribotype 017)
- Assembly: 42 contigs >1kb (25 contigs >2kb); total size 4.205 Mb
- Shotgun: 34,226 reads totalling 23.359 Mb, theoretical genome coverage of 99.99%
C. difficile strain 855
- Isolated: USA, 2007 (Ribotype 027)
- Assembly: 31 contigs >1kb (22 contigs >2kb); total size 4.220 Mb
- Shotgun: 33,918 reads totalling 22.907 Mb, theoretical genome coverage of 99.99%



