Development and support of single-cell analysis pipelines for various experimental protocols, such as 10X, Smartseq2, scATACseq, CITEseq, inDrop etc.
Maintenance and support of the programme's Jupyter Hub dedicated for the data secondary analysis. We provide software support and a set of standard analysis notebooks, such as Seurat, scanpy, single cell data integration notebooks etc.
Development of the programme's internal Web portal which includes a multiuser sample tracker, ability for the users to run our pipelines from the web and integration with our Jupyter Hub.
Development and support of the programme's Imaging portal which allows the users to both run the imaging analysis pipelines and visualise their results.
Deployment of the publication supporting websites containing concise data visualisations.
Our Development Stack
Our development stack is based on the Sanger’s High Performance Compute architecture (thousands of cores orchestrated by LSF) and the Sanger’s OpenStack Flexible Compute Architecture (private OpenStack cloud with thousands of cores orchestrated by Kubernetes) and consists of the following items (more information is in our GitHub organisation):
Back End: Docker, Singularity, Kubernetes, Terraform, Ansible
We have an excellent experience with student internships and apprenticeships. We welcome students with their own funding (with a possibility of topping it up) to work on both infrastructure and research projects. Please contact Vladimir Kiselev for further information. For apprenticeship information please check the dedicated Sanger website.
We explore the consequences of genome variation on human cell biology, and thus gene function in health and disease. We conduct large-scale systematic screens to discover the impact of naturally-occurring and engineered genome mutations in human iPS cells, their differentiated derivatives, and other cell types.
Partners and Funders
Within Sanger, we work closely with the following groups:
Gene expression involves the transformation of genetic information encoded in DNA sequence into a gene product, such as a protein. Regulation of gene expression is a fundamentally important process in biology because controlling the timing, location and level of gene expression is critical for the gene product to function correctly. The majority of mutations that alter disease risk for most common diseases are thought affect gene regulation, although how these mutations actually function is not well understood in most cases. Our group uses a combination of statistical and experimental approaches to map mutations that affect gene regulation in humans.
The Hemberg group is interested in developing quantitative models of gene expression. Our approach is theoretical and we strive to develop novel mathematical models as well as computational tools that can be used by other researchers.
Our Informatics support team is responsible for both developing and providing scale out scientific compute platforms that can both meet todays scientific challenges but also provide support and development as new and potentially disruptive technologies arise. Today our team develops and maintains:
A Sanger private cloud environment (Flexible Compute Environment). High Performance Compute (HPC) platforms for our bioinformatics teams. Provide Lustre and iRODS storage systems to support Sanger's HPC environmentDeveloped a noval performant secure lustre service with multi-tenant supportBringing in new and developing IT technologies. Developing new flexible compute platforms with software defined networking and IaaS. Provide tier 3 support and consultation for our bio-informatics staff and their collaborators.
We measure, model, and modulate cell state. We use genome engineering and synthetic biology to create cell lines that can be employed for CRISPR/Cas9-based genetic screening and high throughput cell biology assays. We develop probabilistic models as well as software tools to accurately analyse the readouts.
The group seeks to elucidate the principles of protein structure evolution, higher order protein structure and protein folding, and the principles underlying protein complex formation and organization. We have a longstanding interest in understanding gene expression regulation, and in our wetlab at the Sanger Institute use mouse T helper cells as a model of cell differentiation.