Human Genetics Informatics (HGI)

Human Genetics

We help the Human Genetics faculty groups evaluate and access the best methods to process and absorb the huge amounts of sequencing data produced by modern studies at Sanger. In practice, this means creating, testing and running variant-calling pipelines, RNASeq pipelines and annotation pipelines on cohorts of tens of thousands of genomes, exomes and transcriptomes.

All this requires superb understanding and control of:

The Sanger’s High Performance Compute architecture (many server farms with thousands of cores)
The Sanger’s OpenStack Flexible Compute Architecture and how to reliably deploy into it
Frameworks to run biological pipelines (e.g. Cromwell, NextFlow)
Frameworks to store, annotate, filter and analyse very large amounts of genomic variant data (e.g. Hail , or more experimental software such as Tachyon)
Tools to help us view and account for our storage and processing

We rely on Sanger’s systems teams, and cooperate extensively with Sanger core teams and other Sanger program informatics teams such as Cancer Informatics and Cell Gen Informatics to share experience and practice.

We aim to deliver data in a reliable way and continously improve how and what we deliver. One thing is clear – we can no longer hand over vcf files and call it a day! Interested? Come talk to us!

Our people

Group lead

Vivek Iyer

Human Genetics Informatics Team Lead

Core team

Mr Sendu Bala

Principle Software Developer

Allan Daly

Senior Bioinformatician

Dr Ruth Eberhardt

Senior Bioinformatician

Dr Matiss Ozols

Principal Bioinformatician

Iaroslav Popov

Senior Bioinformatician

Dr Mark Thomas

Principal Bioinformatician

Previous core team members

Piyush Ahuja

Senior Software Developer

Dr Pavlos Antoniou

Principal Software Developer

Irina Gabriela Colgiu

Senior Software Developer

Mr Daniel Joseph Elia

Sandwich Year Placement / Software Developer

Dr Edgar Garriga Nogales

Senior Bioinformatician

Christopher Harrison

Senior Software Developer

Emyr James

Principal Systems Administrator / Principal DevOps Engineer

Filip Makosza

Sandwich Year Placement

Dr Guillaume Noell

Senior Software Developer

Colin Nolan

Former Group Leader (Acting)

Dr Hannes Ponstingl

Principal Bioinformatician

Dr Joshua C. Randall

Senior Scientific Manager

Associated research

Tools & software

Tool

Hail analysis pipelines

Hail based analysis pipelines for HG projects: pipelines for QC of genome - sequenced cohorts, and GWAS after QC. Designed to ...

Tool

Hgi Cloud

Terraform and ansible codebase to provision clusters (e.g. hail/spark) at Sanger. The framework can be used to provision ...

Tool

Nextflow-RNASeq

A bioinformatics analysis pipeline used for RNA sequencing data, written in the new nextflow DSL2 language syntax, leveraging nextflow modules.

Tool

Shepherd

This code performs targeted archiving of the files arising from the analysis of Sanger sequencing projects.

Tool

Weaver

Browser-based Shiny frontend to view internal Lustre volume reports.

Related groups

Science group

Anderson Group

Genomics of inflammation and immunity

The goal of our research is to use high-throughput screens to gain causal insights into the biological basis of human disease, ...

Science group

Davenport Group

Functional Genomics

We combine genomic analysis with clinical data to understand how genetics contributes to the variation between patients in their disease severity ...

Science group

Human Genetics Administration

Human Genetics

The Human Genetics Administration comprises a five strong team that provides comprehensive support for the smooth running of the Human Genetics ...

Science group

Hurles Group

Genomic mutation and genetic disease

The Hurles group studies the genetic causes and mechanisms of rare genetic disorders and how DNA mutates as it is pass ...

Science group

Informatics Support Group

High Performance Computing

We deliver the at-scale computational platforms that enable the Sanger Institute’s scientists to deliver genomic research that others are unable ...

Science group

Informatics and Digital Solutions

Scientific Computing

We support the Sanger Institute’s mission to deliver innovative and ambitious genomics research at a scale to improve human health ...

Science group

Martin Group

Medical and population genomics

We analyse large-scale genetic and phenotype data to explore the genetic architecture of rare and common diseases and related quantitative traits. ...

Science group

New Pipeline Group (NPG)

Sequencing Informatics

NPG is responsible for the delivery of DNA Pipelines's data products and the provision of informatics expertise and QC systems.

Science group

Parts Group

Understanding human DNA function by engineering

Our goal is to mechanistically understand impact of mutations in human DNA. To do so, we engineer DNA variation in cells, ...

Science group

Soranzo Group

Human Complex Traits

Our research focuses on the application of large-scale genomic analysis to unravel the spectrum of human genetic variation associated with cardiometabolic ...

Wellcome Sanger Institute

Programmes and Facilities

Programme

Human Genetics

The Human Genetics Programme is driving a step-change in our understanding of genetic causes and biological mechanisms of disease susceptibility and ...

Programme

Information Communications Technology

Our goal is "To provide World Class High Performance Computing and First Class Production Platforms and Services for genome and biodata ...

Careers and Study

Policies

Archive

Leadership

Faculty

Human Genetics Informatics (HGI)

Our people

Group lead

Vivek Iyer

Core team

Mr Sendu Bala

Allan Daly

Dr Ruth Eberhardt

Dr Matiss Ozols

Iaroslav Popov

Dr Mark Thomas

Previous core team members

Piyush Ahuja

Dr Pavlos Antoniou

Irina Gabriela Colgiu

Mr Daniel Joseph Elia

Dr Edgar Garriga Nogales

Christopher Harrison

Emyr James

Filip Makosza

Dr Guillaume Noell

Colin Nolan

Dr Hannes Ponstingl

Dr Joshua C. Randall

Associated research

Hail analysis pipelines

Hgi Cloud

Nextflow-RNASeq

Shepherd

Weaver

Related groups

Anderson Group

Davenport Group

Human Genetics Administration

Hurles Group

Informatics Support Group

Informatics and Digital Solutions

Martin Group

New Pipeline Group (NPG)

Parts Group

Soranzo Group

Programmes and Facilities

Human Genetics

Information Communications Technology