We have played lead or key roles in the data processing and analysis of large scale sequencing projects such as 1000 Genomes, Mouse Genomes Project, UK10K, HipSci, and Haplotype Reference Consortium among others.
Recently, in collaboration with the Durbin and GRIT groups at the Sanger Institute, along with a number of external partners, we have joined the Vertebrate Genomes Project and Genome 10K to begin producing genome assemblies for hundreds to thousands of species, using cutting edge long-read sequencing technologies like PacBio, Oxford Nanopore and 10x alongside Illumina.
We develop tools and software to manage our data management and analysis needs at scale.
We develop pipelines and pipeline management systems to track and process our data. The 1000 Genomes and UK10K projects were made possible using the VRPipe and vr-runner systems. With the Sanger Institute recently moving to a cloud oriented compute infrastructure we are developing a new workflow runner (wr) system.
Haplotype Reference Consortium
The Haplotype Reference Consortium (HRC) is a collaboration to create a large reference panel of human haplotypes by combining together ...
Experimental Cancer Genetics
We are a team of cancer biologists, geneticists and computational biologists interested in understanding how cancers develop and the ways of ...
Population and evolutionary genomics, novel computational genomics methods, and related mathematical and statistical models.
Genome Reference Informatics Team
The Genome Reference Informatics Team analyses genome assemblies to reveal and correct quality issues and to identify and add variation. It ...
New Pipeline Group (NPG)
DNA Pipelines Informatics
NPG is responsible for DNA Pipelines's production informatics analysis pipelines, Illumina sequencing QC tools and expertise, and internal archiving ...
Sequence Analysis and Management (SAM)
SAM contributes to various software packages for processing DNA sequence data, including samtools, htslib, biobambam and the Staden package. We also ...