Genome Reference Informatics Team
The team consists of Senior Bioinformaticians and Senior Computer Biologists to cover software development, data analysis and genome curation. Besides comissioning sequencing and generation of genome data (e.g. optical mapping) and assembly generation, we develop bespoke software like gEVAL to identify and visualize genome assembly issues. This is used by our curators to resolve those issues through changes or additions to the assemblies, thereby vastly improving assembly accuracy.
We are evaluating and improving assemblies as members of the Darwin Tree of Life Project (see also here), the Vertebrate Genome Project, the Human Pangenome Project, the Genome Reference Consortium and others. We work closely with our consortium partners and other collaborators to ensure access to the latest data and analyses. Assembly improvements are submitted to INSDC on a regular schedule.
Genome Reference Consortium
The Genome Reference Consortium aims to ensure that the human, mouse and zebrafish reference assemblies are biologically relevant by closing gaps, ...
Mouse Genomes Project
We are collaborating with the Mouse Genomes Project to improve the individual strain assemblies.
Vertebrate Genomes Sequencing
The Sanger Institute is developing a major programme in biological diversity genome sequencing across the tree of life. One of ...
All life is linked by the common thread of DNA, modified through evolution. We use whole genome sequences to explore ...
High Performance Algorithm Group (HPAG)
Collaborate on the generation of zebrafish assemblies.
We work with the following groups
The GRC aims to ensure that the human, mouse and zebrafish reference assemblies are biologically relevant by closing gaps, fixing errors and representing complex variation.
The Vertebrate Genomes Project (VGP), a project of the G10K Consortium, aims to generate near error-free reference genome assemblies of all 66,000 extant vertebrate species.