Human Genetics Informatics (HGI)
All this requires superb understanding and control of:
- The Sanger’s High Performance Compute architecture (many server farms with thousands of cores)
- The Sanger’s OpenStack Flexible Compute Architecture and how to reliably deploy into it
- Frameworks to run biological pipelines (e.g. Cromwell, NextFlow)
- Frameworks to store, annotate, filter and analyse very large amounts of genomic variant data (e.g. Hail , or more experimental software such as Tachyon)
- Tools to help us view and account for our storage and processing
We rely on Sanger’s systems teams, and cooperate extensively with Sanger core teams and other Sanger program informatics teams such as Cancer Informatics and Cell Gen Informatics to share experience and practice.
We aim to deliver data in a reliable way and continously improve how and what we deliver. One thing is clear – we can no longer hand over vcf files and call it a day! Interested? Come talk to us!
Previous team members
Hail analysis pipelines
Hail based analysis pipelines for HG projects: pipelines for QC of genome - sequenced cohorts, and GWAS after QC. Designed to ...
Terraform and ansible codebase to provision clusters (e.g. hail/spark) at Sanger. The framework can be used to provision ...
A bioinformatics analysis pipeline used for RNA sequencing data, written in the new nextflow DSL2 language syntax, leveraging nextflow modules.
This code performs targeted archiving of the files arising from the analysis of Sanger sequencing projects.
Genomics of inflammation and immunity
The goal of our research is to use high-throughput screens to gain causal insights into the biological basis of human disease, ...
We combine genomic analysis with clinical data to understand how genetics contributes to the variation between patients in their disease severity ...
Human Genetics Administration
The Human Genetics Administration comprises a five strong team that provides comprehensive support for the smooth running of the Human Genetics ...
Genomic mutation and genetic disease
The Hurles group studies the genetic causes and mechanisms of rare genetic disorders and how DNA mutates as it is pass ...
Informatics Support Group
High Performance Computing
We deliver the at-scale computational platforms that enable the Sanger Institute’s scientists to deliver genomic research that others are unable ...
Informatics and Digital Solutions
We support the Sanger Institute’s mission to deliver innovative and ambitious genomics research at a scale to improve human health ...
Medical and population genomics
We analyse large-scale genetic and electronic health record data to explore fine-scale population structure, its impact on disease risk, and the ...
New Pipeline Group (NPG)
NPG is responsible for the delivery of DNA Pipelines's data products and the provision of informatics expertise and QC systems.
Understanding human DNA function by engineering
Our goal is to mechanistically understand impact of mutations in human DNA. To do so, we engineer DNA variation in cells, ...