Cellular Genetics Informatics
- Comprehensive Bioinformatics support for the programme users.
- Development and support of single-cell Nextflow analysis pipelines for various experimental protocols, such as 10X, Smartseq2, scATACseq, TraCeR/BraCeR, CITEseq, inDrop etc. All of our pipelines have full HPC (LSF) and OpenStack Cloud integrations.
- Data sharing with external collaborators.
- Data submission to ArrayExpress.
- Maintenance and support of the programme’s Jupyter Hub dedicated for the data secondary analysis. We provide software support and a set of standard analysis notebooks, such as Seurat, scanpy, single cell data integration notebooks etc.
- Data access for external collaborators via Jupyter Hub.
- Development of the programme’s internal Web portal which includes a multiuser sample tracker, ability for the users to run our pipelines from the web and integration with our Jupyter Hub.
- Development and support of the programme’s Imaging portal which allows the users to both run the imaging analysis pipelines and visualise their results.
- Development of GPU-accelerated imaging pipelines.
- Deployment of bespoke and the publication supporting websites containing models and concise data visualisations.
- Organisation of workshops for the users on different topics, such as Git, Jupyter Notebooks, Docker/Cloud, Rstudio/Shiny, Nextflow.
Illustrator: Christina Usher
Our Software/Analysis Stack
We run our pipelines and perform analysis on the Sanger’s High Performance Compute cluster (thousands of cores orchestrated by LSF) and the Sanger’s OpenStack Flexible Compute environment (private OpenStack cloud with thousands of cores orchestrated by Kubernetes). We use the following software/analysis stack (more information is in our GitHub organisation):
- Back End: Kubernetes, Docker, Singularity, Terraform, Ansible
- Interactive Analysis Environments: Jupyter Hub
- Pipelines runners: Nextflow
- Secondary Analysis: R, python, C, bash
- Imaging: Omero, Bio-Formats, Fiji, cellprofiler, StarFish
Our team is still growing and there will be new positions available in 2020-2021, please check the Sanger vacancies website for more information.
We have an excellent experience with student internships and apprenticeships. We welcome students with their own funding (with a possibility of topping it up) to work on both infrastructure and research projects.
Informatics Support Group
High Performance Computing
Our Informatics support team is responsible for both developing and providing scale out scientific compute platforms that can both meet todays ...
New Pipeline Group (NPG)
NPG is responsible for the delivery of DNA Pipelines's data products and the provision of informatics expertise and QC systems.
Function of human DNA and its variation
Our goal is to understand how genetic background influences outcome of mutations. To do so, we measure, model, and modulate cell ...
Gene expression genomics
We use cutting edge single cell genomics technologies and computational methods to understand genes, proteins and cells in human health and ...
The Trynka group combines experimental and computational approaches to study how genetics control the immune system and predispose individuals to autoimmune ...
Rodent models of malaria
At the Sanger Institute Oliver Billker's group used experimental genetics in rodent models to study the basic biology of malaria ...
Genomics of gene regulation
Gene expression involves the transformation of genetic information encoded in DNA sequence into a gene product, such as a protein. Regulation ...
Quantitative models of gene expression
The Hemberg group is interested in developing quantitative models of gene expression. Our approach is theoretical and we strive to develop ...