Manual Annotation

Careers | Find us | Contact us

About
- Careers and Study
  Careers and Study
  Working at the Sanger Institute is truly unique. We put collaboration, innovation and support for people as individuals at the centre of everything we do. Join us to help shape the future by delivering life-changing science with the reach, scale, and creativity to solve some of humanity’s greatest challenges.
  About Us
  We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
  Read more
  - Careers
    Join our community of world class thinkers and professionals at the Sanger Institute located in Cambridge. Together we achieve life-changing science.
  - Study
    We are committed to training the next generation of pioneering genome scientists and clinicians. At the Wellome Sanger Institute we give PhD students and postdocs all the tools they need to succeed in the field of genomics research.
- Who we are
  Who we are
  Our vision and mission is to deliver world-leading genomics research in collaboration with research partners across the globe. Discover how our funding gives our leadership the independence to conduct bold, ambitious science that pioneers new fields in health, disease and conservation.
  About Us
  We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
  Read more
  - Our Vision and Strategy
    By focusing on fundamental discovery research led by our faculty and employing our unique scale in cutting-edge data generation and analysis, we deliver discoveries not easily made elsewhere.
  - Impact
    From providing fundamental resources for understanding biology to exploring cancer genomes and the effects of variation in human genomes, our work lays the foundations for personalised medicine. We also reveal the secrets of human development and how infectious diseases evolve and spread.
  - Leadership and Governance
    Discover how our leadership and structures are designed to enable holistic and effective decision making, with transparency and accountability woven into their make up.
  - Funding
    Since 1992, Wellcome has invested in the Sanger Institute to deliver bold, long-term discovery science at scale, by developing and deploying high-calibre people and technologies.
- Equity, Diversity and Inclusion
  Equity, Diversity and Inclusion
  The diversity in skills and knowledge that we all bring make our Institute the thriving ideas factory that it is. Discover how we support each other to reach our full potential and thrive. We celebrate diversity and seek to ensure that everyone has equal access to professional and career development opportunities.
  About Us
  We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
  Read more
- Policies
  Policies
  We play a pivotal role in helping to shape Government and International research policies. We also lead the way in developing guidance to support our scientists to carry out their research ethically, equitably and responsibly.
  About Us
  We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
  Read more
  - Influencing Policy
    We advise all levels of government both in the UK and across the world on the the role, impact and importance of genomic and life science research.
  - Research Policies
    Our policies help our researchers carry out their science collaboratively, equitably, ethically and responsibly.
- Admin Groups
  Admin Groups
  Browse our management and support operations teams that facilitate the Sanger Institute’s science.
  About Us
  We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
  Read more
- Campus Connections
  Campus Connections
  We are sited on the Wellcome Genome Campus at the very heart of a global hub of fundamental and applied genomic research, education and engagement. It is home to some of the world’s foremost institutes and organisations using genomes and biodata to deliver science with the reach, scale and imagination to solve some of humanity’s greatest challenges and maximise societal benefit.
  About Us
  We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
  Read more
About
We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
About Us
We tackle some of the most difficult challenges in genomic research. This demands science at scale; a visionary and creative approach to research that pushes the boundaries of our understanding in ever new and exciting ways.
Read more
Science
- Programmes
  Programmes
  Our research is organised into six primary Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus. In addition, our Associate Research programmes pioneer new approaches to studying health and disease.
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
- Groups
  Groups
  Browse the research, scientific and support teams in the Sanger Institute.
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
- Collaborations
  Collaborations
  Explore the national, international and global research projects and collaborations we either lead or actively contribute to.
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
- Data
  Data
  Browse our genomic and genetic data resources and repositories.
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
- Software and Resources
  Software and Resources
  We freely and openly provide the global research community with a wide range of genomics software, protocols, and platforms.
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
- Publications
  Publications
  The Sanger Institute has published papers in some of the most prestigious scientific journals. We aim to publish research that will transform biology and improve healthcare.
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
- Research Policies
  Research Policies
  Sanger Institute’s Research Policies are designed to provide guidance to help researchers navigate the legislation relating to their research and to ensure that research is ethical and legal.
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
- Archive
  Archive
  Science & technology
  Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
  Read more
  - COVID-19
    The Sanger Institute played a major role in the genomic surveillance of the COVID-19 pandemic, providing large-scale high-throughput sequencing of the SARS-CoV-2 virus and analysis of its evolution and spread in the UK.
  - Sanger Seminar Series
    Throughout the COVID-19 lockdown of 2020-2021 we hosted a series of monthly freely available and open virtual seminars. From using genomic approaches to map all cell types in the human body, understand how cancer develops, and track the evolution and spread of global diseases, our senior scientists and faculty presented the latest developments in their field.
Science
Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease or analytic focus. In all cases, the studies provide insights into human, pathogen, cellular evolution, the phenotypic and hence biological consequences of genome variation and the processes which cause mutations.
Science & technology
Our science is organised into six Scientific Programmes, each defining a major area of research with a particular biological, disease, analytic or generative focus.
Read more
People
- Leadership
  Leadership
  Join us
  Our science is founded on the talents, imagination and curiosity of our people. Our wet-lab scientists, bioinformaticians, developers, engineers and skilled administrators work together to deliver cutting-edge research. Join us
  Read more
  - Sanger Leadership Team
    The Sanger Leadership Team is an Executive Committee that enables holistic and effective decision making, with transparency and accountability woven into its make up.
  - Governance
    How we are organised to enable transparent, open, responsive leadership.
  - Institute Scientific Advisory Board
    We draw on a number of experienced and internationally recognised scientists to provide independent scientific support, advice and challenge to help us maintain our scientific excellence.
- Faculty
  Faculty
  Join us
  Our science is founded on the talents, imagination and curiosity of our people. Our wet-lab scientists, bioinformaticians, developers, engineers and skilled administrators work together to deliver cutting-edge research. Join us
  Read more
  - Faculty
    Our Faculty conceive and deliver our science. Within our strategic framework the Institute’s scientific aspirations are driven by their vision, imagination and intellectual energy.
  - Associate Faculty
    Our Associate Faculty combine their skills and knowledge with the Sanger Institute’s unique abilities to conduct data generation and analysis at scale to pioneer genomic research in new areas.
  - International Fellows
    Our International Fellows Programme empowers early-career researchers with resources, mentorship, and funding to advance genomic research globally.
  - Honorary Faculty
    Our Honorary Faculty contribute to our research by providing insights and knowledge and collaborating on important research projects.
- All Sanger Staff
  All Sanger Staff
  Almost 1,000 scientists, developers, engineers and skilled professionals work together to deliver the Sanger Institute’s cutting-edge genomic research.
  Join us
  Our science is founded on the talents, imagination and curiosity of our people. Our wet-lab scientists, bioinformaticians, developers, engineers and skilled administrators work together to deliver cutting-edge research. Join us
  Read more
  - Science Staff
    From PhD students and Postdoctoral Fellows, bioinformaticians and laboratory managers, search for our staff who the support the delivery of pioneering science.
  - Non-Science Staff
    Look up our management and support staff that facilitate the Sanger Institute’s science.
People
Almost 1,000 scientists, developers, engineers and skilled professionals work together to deliver the Sanger Institute’s cutting-edge genomic research.
Join us
Our science is founded on the talents, imagination and curiosity of our people. Our wet-lab scientists, bioinformaticians, developers, engineers and skilled administrators work together to deliver cutting-edge research. Join us
Read more
Innovation
- Innovation at the Institute
  Innovation at the Institute
  We deliver therapeutic, diagnostic and production benefits in areas as diverse as cancer, immunology, public health and personalised treatment.
  Sanger Innovation
  We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
  Read more
- For Industry
  For Industry
  Find out how our translation team maximises the socioeconomic impact of the Sanger Institute’s discoveries by translating our science into products, services and technologies that benefit patients in a variety of settings.
  Sanger Innovation
  We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
  Read more
- For Researchers
  For Researchers
  Read how we benefit society by buiding on the innovative capabilities of our people by engaging with businesses and creating commercial opportunities. We also develop a unique and vibrant ecosystem to establish and grow innovative genomics and biodata businesses.
  Sanger Innovation
  We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
  Read more
- Case Studies
  Case Studies
  Read examples of how we engage with funding, R&D, service and clinical communities to promote real-world utilisation of the Sanger Institute’s technologies and resources.
  Sanger Innovation
  We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
  Read more
- Our Spin-Outs
  Our Spin-Outs
  We have a culture and history of scaling technologies. Some have become spin-out companies that positively impact today’s healthcare sector.
  Sanger Innovation
  We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
  Read more
- Sanger Technologies
  Sanger Technologies
  Browse our technology and therapy opportunities.
  Sanger Innovation
  We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
  Read more
- Genomics Futures Series
  Genomics Futures Series
  From February to July 2025 Wellcome Sanger Institute and Wellcome hosted the Genomics Futures workshop series, inviting a breadth of professionals including genomics researchers, industry professionals, policy makers and ethicists were invited to explore the future genomics landscape. The outputs of these workshops are available here for use, interest and to spark discussion within the wider scientific community.
  Sanger Innovation
  We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
  Read more
Innovation
We deliver therapeutic, diagnostic and production benefits in areas as diverse as cancer, immunology, public health and personalised treatment.
Sanger Innovation
We apply our science to benefit society by empowering innovators and engaging with businesses and investors - driven by Sanger Genomics Innovation
Read more
News
- News and Blogs
  News and Blogs
  Read our latest news and stories of the science and people of the Sanger Institute.
  News and Blogs
  News and stories Announcements, articles and blogs from the cutting edge of genomic research.
  Read more
- Press Office
  Press Office
  The Communications team promote the Sanger Institute’s research and discoveries, using both traditional media such as print, radio interviews and TV footage and social media like Twitter, Facebook and the Sanger Institute blog
  News and Blogs
  News and stories Announcements, articles and blogs from the cutting edge of genomic research.
  Read more
- Branding and Logos
  Branding and Logos
  Wellcome Sanger Institute Branding Guidelines and Logos.
  News and Blogs
  News and stories Announcements, articles and blogs from the cutting edge of genomic research.
  Read more
Latest News

4 Jun 2026

Chasing the secrets of healthy stool
Read more

3 Jun 2026

Study of millions of cells reveals new way to understand genetic risk of disease
Read more

26 May 2026

From bases to breakthroughs: Editing cancer’s weak spots with CRISPR
Read more

News and Blogs
News and stories Announcements, articles and blogs from the cutting edge of genomic research.
Read more

Wellcome Sanger Institute

Sanger Institute Science Collaboration

Manual Annotation

The HAVANA team manually annotate the human, mouse, zebrafish and other vertebrate genomes.

The HAVANA team puts special emphasis on alternatively spliced transcripts and pseudogenes, two areas still underdeveloped in automated annotation systems, as well as poly-adenylation features. Also, where other systems concentrate on, or are limited to, protein-coding genes, many HAVANA transcripts are annotated without a protein-coding region. These transcripts may function as non-coding RNAs or they may be incomplete gene fragments for which the coding sequence cannot yet be determined.

All annotated gene structures (transcripts) are supported by transcriptional evidence, either from cDNA, EST or protein sequences. As such not all annotated transcripts are necessarily complete. Support does not need to come from locus-specific evidence, but can also be homologous, paralogous or orthologous.

While the transcript and protein sequences are the most important pieces of information, HAVANA annotation takes into account and uses other data, such as CpG islands, gene predictions, repeats and genome signatures. Because the annotation software used is DAS (Distributed Annotation System) aware, the HAVANA team can link to external data sources. Ensembl gene models and data from GENCODE collaborators are some of the DAS sources the HAVANA group uses. HAVANA sources are under constant review and subject change. For example, the group recently started to use data from new technologies such as RNAseq and protein mass spectrometry in its annotation efforts.

Like its data sources, HAVANA's annotation guidelines are under constant review and are routinely updated to take into account feedback from collaborators, incorporate new data sources and reflect new trends in genetics, transcriptomics, proteomics and genomics.

HAVANA Annotation Guidelines detail our annotation standards.

All of our manual annnotation is displayed in the VEGA browser.

Our annotation is also available in Ensembl and UCSC.

External partners and funders

External

HUGO Genome Nomenclature Committee

External

Mouse Genome Informatics

External

Zebrafish Model Organism Database

External

Rat Genome Database

External

CCDS

Related groups

Science group

Vertebrate Annotation

Human Genetics

This group consists of manual annotators and software developers. The HAVANA team provides the manual annotation of human, mouse, zebrafish and ...