Haemophilus

Haemophilus is a genus of Gram-negative, pleomorphic, coccobacilli bacteria belonging to the Pasteurellaceae family, including common commensals of the human upper respiratory tract, some of which are associated with human disease. H. influenzae, and rarely H. parainfluenzae, cause numerous mild and serious infections, including bacteremia, meningitis, pneumonia and otitis media.

Data Downloads

[Genome Research Limited]

The Sanger Institute has been funded by the Wellcome Trust to sequence the genomes of

Haemophilus influenzae serotype B strain 10810

Haemophilus influenzae biogroup Aegypticus BPF strain F3031

Haemophilus influenzae biogroup Aegypticus conjunctivitis strain F3047

Haemophilus parainfluenzae strain T3T1

Collaborators

Dr. Derrick Crook of the University of Oxford Nuffield Department of Clinical Laboratory Sciences, John Radcliffe Hospital, Oxford

Dr. Derek Hood and Prof. Richard Moxon of the University of Oxford Department of Paediatrics, John Radcliffe Hospital, Oxford

Professor Simon Kroll of the Department of Paediatric Infectious Diseases at Imperial College, London

Also in collaboration with Dr. Derek Hood and Dr. Peter Power of the University of Oxford Department of Paediatrics, John Radcliffe Hospital, Oxford we have initiated a Wellcome Trust funded project to study population structure in the genus Haemophilus and mechanisms of recombination in Haemophilus influenzae and Neisseria meningitidis.

Published Genome Data

We have completed the details of several strains of Haemophilus, the details of which are given below.

Finished Sequence

Haemophilus influenzae

The genome of Haemophilus influenzae is 1,981,535 and has a G+C content of 38.14%. The data is available for searching on our BLAST server or for download from our FTP site.

Haemophilus influenzae strain F303

We have completed the genome of Haemophilus influenzae biogroup Aegypticus BPF strain F3031. Biogroup Aegypticus is normally associated with purulent conjuctivitis. Strain F3031 is a representative of a clone associated with the Brazilian purpuric fever outbreak in the early 1980s. The genome consists of a single chromosome of 1,985,824 bp with a G+C content of 38.22% and a plasmid of 32,433 bp.

Haemophilus parainfluenzae

Haemophilus parainfluenzae is a Gram negative commensal of the human respiratory tract. Strain T3T1 was obtained from the throat of a healthy child, and has been confirmed by 16S rRNA sequence. The genome is now finished and is 2,086,875 bp in size with a G+C content of 39.57%.

Shotgun and assembly data from these projects are available from our FTP site.

Ongoing Projects

Haemophilus influenzae strain F3047

Haemophilus influenzae is a Gram negative commensal of the human respiratory tract that can cause meningitis and other diseases. Strain F3047 is a representative of biogroup Aegypticus which is normally associated with purulent conjuctivitis.

At present there are 31,070 reads totalling 16.797 Mb and giving a theoretical coverage of 99.95% of the genome. There are 7 contigs > 1kb (3 contigs >2kb) with a total size of 2.207 Mb. Finishing of this genome is in progress.

Discovery of sequence diversity in Haemophilus influenzae

Sample

Strain

Run Accession

Fi1247Fi1247 ERR019339
Fi432Fi432 ERR019343
Fi285Fi285 ERR019344
Fi667Fi667 ERR019345
Fi1180Fi1180 ERR019346
Fi176Fi176 ERR019340
Fi162Fi162 ERR019348
jch787jch787 ERR019349
Fi1200Fi1200 ERR019350
Fi981Fi981 ERR019347
Fi1124Fi1124 ERR019341
Fi723Fi723 ERR019342
RM7190RM7190 ERR019352
eaganeagan ERR019356
PL h.hl nctc 10659(T)PL h.hl nctc 10659(T) ERR019357
RM6062RM6062 ERR019358
RM6064RM6064 ERR019359
RM6073RM6073 ERR019360
RM6132RM6132 ERR019361
RM6134RM6134 ERR019362
RM6158RM6158 ERR019363
RM6237RM6237 ERR019353
RM7017RM7017 ERR019354
RM7066RM7066 ERR019355
RM7283RM7283 ERR028079
RM7290RM7290 ERR028082
RM7414RM7414 ERR028083
RM7419RM7419 ERR028084
RM7651RM7651 ERR028085
RM7422RM7422 ERR028086
PL MIOG 2836PL MIOG 2836 ERR028087
PL MIOG 2819PL MIOG 2819 ERR028088
PL MIOG 2820PL MIOG 2820 ERR028089
PL MIOG 2822 H/LPL MIOG 2822 H/L ERR028080
PL MIOG 2837PL MIOG 2837 ERR028081
DC T2T1 ST-34DC T2T1 ST-34 ERR019365
DC T5A1 ST-41DC T5A1 ST-41 ERR019369
DC T7B2 ST-47DC T7B2 ST-47 ERR019370
DC T8A1 ST-52DC T8A1 ST-52 ERR019371
PL MIOG 2838 H/LPL MIOG 2838 H/L ERR019372
PL 10839TPL 10839T ERR019373
DC O/OM30/1/A1DC O/OM30/1/A1 ERR019374
DC O/OM33/2/B3 ST-21DC O/OM33/2/B3 ST-21 ERR019375
DC G/T53T1DC G/T53T1 ERR019376
DC T2B3 ST-33DC T2B3 ST-33 ERR019366
DC T8B3 ST-51DC T8B3 ST-51 ERR019367
DC MO/009/14/S/TR ST-12DC MO/009/14/S/TR ST-12 ERR019368
DH 398 spainDH 398 spain ERR019378
RY 8RY 8 ERR019382
DH 1513 spainDH 1513 spain ERR019383
DH 1559 spainDH 1559 spain ERR019384
DH 1630 spainDH 1630 spain ERR019385
DC O/CFE24/1/T2 ST-27DC O/CFE24/1/T2 ST-27 ERR019386
DC MO/099/5/L ST-8DC MO/099/5/L ST-8 ERR019387
DC MO/099/8/M ST-8DC MO/099/8/M ST-8 ERR019388
RY 20RY 20 ERR019389
RY 22RY 22 ERR019379
RY 15RY 15 ERR019380
DH 1500 spainDH 1500 spain ERR019381
PPHPSPPHPS ERR028091
RM6006RM6006 ERR028094
RM6011RM6011 ERR028095
RM6051RM6051 ERR028096
RM6019RM6019 ERR028097
RM7018RM7018 ERR028098
RM6033RM6033 ERR028099
RM7028RM7028 ERR028100
RM7054RM7054 ERR028092
RM7068RM7068 ERR028093
RM7459RM7459 ERR028102
RM7465RM7465 ERR028105
RM7477RM7477 ERR028106
RM7490RM7490 ERR028107
RM7617RM7617 ERR028108
RM7637RM7637 ERR028109
RM7876RM7876 ERR028110
RM7122RM7122 ERR028111
RM7308RM7308 ERR028112
RM7309RM7309 ERR028103
RM7448RM7448 ERR028104
RM7582RM7582 ERR019391
RM7598RM7598 ERR019395
RM7578RM7578 ERR019396
DC G1574DC G1574 ERR019397
DC H3151DC H3151 ERR019398
DC 800DC 800 ERR019399
DC 7331DC 7331 ERR019400
DC 7654DC 7654 ERR019401
DC 7695DC 7695 ERR019402
DC 8708DC 8708 ERR019392
DC g2120DC g2120 ERR019393
DC 11238DC 11238 ERR019394
Rd-eaganRd-eagan ERR019403
Rd-RH154Rd-RH154 ERR019404

Data Use Statement

This sequencing centre plans on publishing the completed and annotated sequences in a peer-reviewed journal as soon as possible. Permission of the principal investigator should be obtained before publishing analyses of the sequence/open reading frames/genes on a chromosome or genome scale. See our data sharing policy.

Sequencing enquiries

Please address all sequencing enquiries to: pathinfo@sanger.ac.uk

* quick link - http://q.sanger.ac.uk/6jxwl9tf