Data

Human

1000 Genomes

The 1000 Genomes Project developed a new map of the human genome at a resolution that was unmatched by other ...

Bacterial

Bacterial Data

This Archived page provides historical information on the genome sequences of bacteria that were sequenced at the Wellcome Sanger Institute and ...

Bacteriophage

Bacteriophage Data

All bacteriophage genomes sequenced by the Sanger Institute

Human

Cancer Gene Census

The Cancer Gene Census is a high-confidence list of genes with substantial published evidence in oncology.

Bacterial

Culture collection of human gut bacteria

Host-Microbiota Interactions Laboratory

Bacterial

Culture collection of reference Clostridium difficile isolates

Host-Microbiota Interactions Laboratory

Disease Vector

All the Disease Vectors sequenced by the Sanger Institute

Gorilla

Ensembl - Gorilla

The Gorilla sequencing project and how to access the publicly available draft assembly through Ensembl

Human

Ensembl - Human

Access to the reference human genome sequence

Mouse

Ensembl - Mouse

Access to the reference mouse genome sequence, other mouse genome sequences and to individual mouse chromosomes

Zebrafish'

Ensembl - Zebrafish

Access to the zebrafish genome sequence in Ensembl

Human

ExoSeq

The Institute sequenced 9.6 million PCR products from 50 individuals between 2003 and 2007 to understand variation in human protein ...

Bacterial

Female bladder microbiota bacterial genomes

Host-Microbiota Interactions Laboratory

Fungal

All fungal genomes sequenced by the Sanger Institute

Other Vertebrate

Genome Notes - Darwin Tree of Life

Genome Notes are the DNA sequences of the reference genomes of the 70,000 UK species of Britain and Ireland as ...

Human

Genome Reference Consortium

The GRC aims to ensure that the human, mouse and zebrafish reference assemblies are biologically relevant by closing gaps, fixing ...

Bacterial

Genomes of cultured human gut bacteria

Host-Microbiota Interactions Laboratory

Gorilla

Gorilla genome - data download

Gorillas, the largest living primates, are humans' closest living relatives after chimpanzees, and are important for the study of human ...

Bacteriophage

Gut phage database

Host-Microbiota Interactions Laboratory

Human

HapMap 3

Access data from the third phase of the International HapMap Project.

Helminth

Helminth Data

All helminth genomes sequenced by the Sanger Institute

Bacterial

Metagenome assembled genomes from the human intestinal microbiota

Host-Microbiota Interactions Laboratory

Bacterial

Metagenomes and whole genome seqeunces from the intestinal microbiota of babies and mothers

Host-Microbiota Interactions Laboratory

Bacterial

Mouse Gastrointestinal Bacteria Catalogue

Host-Microbiota Interactions Laboratory

Mouse

Mouse Genomes Project

The Mouse Genomes Project is an ongoing effort to catalog all forms of genetic variation between the common laboratory mouse strains ...

Mouse

Mouse Phenotype FTP data

Sanger Institute Mouse Phenotyping data

Mouse

Olfactory Transcriptomes of Mice

A catalogue of olfactory transcriptomes and extended chemosensory receptor gene annotations

Other Vertebrate

Pig Genome

Access the map, clone and genome resources from the Porcine Genome Sequencing Project

Plasmid

Plasmid Data

All plasmid data sequenced by the Sanger Institute

Proteomics

Proteomics Datasets

The Proteomic Mass Spectrometry group deposits all their published data into ProteomeXchange.

Proteomics

Proteomics TrackHubs

The Proteomic Mass Spectrometry group releases all their proteogenomics data as track hubs accessible via ftp and http.

Protozoan

Protozoan Data

All protozoan genomes sequenced by the Sanger Institute

Other Vertebrate

Pufferfish

Genome sequence information for the pufferfish can be accessed through our FTP site.

Bacterial

Reference genomes for 906 Clostridium difficile isolates

Host-Microbiota Interactions Laboratory

Bacterial

Reference genomes of Clostridium difficile

We have sequenced and annotated 15 high quality reference genomes of Clostridium difficile.

Other Vertebrate

Vertebrate Genomes Sequencing

The Sanger Institute is developing a major programme in biological diversity genome sequencing across the tree of life. One of the ...

Virus

Virus Data

All viral genomes sequenced by the Sanger Institute

Helminth

Worm Genome

The Institute collaborated in the sequencing of the C. elegans and C. brigssae genomes

Yeast

Yeast Data

SGRP, the Saccharomyces Genome Resequencing Project

Zebrafish

Zebrafish Genome Project

Danio rerio reference genome assemblies and assemblies of additional D. rerio strains and Danio and Danionella species.

Careers and Study

Policies

Archive

Leadership

Faculty

Data directory

Archived

Type

Search

1000 Genomes

Bacterial Data

Bacteriophage Data

Cancer Gene Census

Culture collection of human gut bacteria

Culture collection of reference Clostridium difficile isolates

Disease Vector

Ensembl - Gorilla

Ensembl - Human

Ensembl - Mouse

Ensembl - Zebrafish

ExoSeq

Female bladder microbiota bacterial genomes

Fungal

Genome Notes - Darwin Tree of Life

Genome Reference Consortium

Genomes of cultured human gut bacteria

Gorilla genome - data download

Gut phage database

HapMap 3

Helminth Data

Metagenome assembled genomes from the human intestinal microbiota

Metagenomes and whole genome seqeunces from the intestinal microbiota of babies and mothers

Mouse Gastrointestinal Bacteria Catalogue

Mouse Genomes Project

Mouse Phenotype FTP data

Olfactory Transcriptomes of Mice

Pig Genome

Plasmid Data

Proteomics Datasets

Proteomics TrackHubs

Protozoan Data

Pufferfish

Reference genomes for 906 Clostridium difficile isolates

Reference genomes of Clostridium difficile

Vertebrate Genomes Sequencing

Virus Data

Worm Genome

Yeast Data

Zebrafish Genome Project