Researchers Expand Efforts to Explore Functional Landscape of the Human Genome

Full-Scale ENCODE Project Will Survey Entire Human Instruction Book

Researchers Expand Efforts to Explore Functional Landscape of the Human Genome

The pipeline of the ENCODE pilot, GENCODE. This schematic diagram shows the flow of data among the three groups involved in the GENCODE consortium - HAVANA (Sanger Institute), IMIM (Institut Municipal d'Investigació Mèdica) and Geneva/Lausanne (Universities of Geneva and Lausanne) - to produce an experimentally verified annotation of the ENCODE pilot regions.

The Wellcome Trust Sanger Institute has been awarded $8.5M (£4.2M) by the US National Human Genome Research Institute (NHGRI) as part of a programme to expand the ENCyclopedia Of DNA Elements (ENCODE) project, which in its pilot phase yielded provocative new insights into the organization and function of the human genome.

Dr Tim Hubbard, Head of Informatics at the Sanger Institute, will lead efforts identify gene features across all 3 billion bases of DNA code in the human genome, the only scale-up programme awarded funding that is based outside the US.

"The pilot phase of ENCODE revealed a great deal, yet showed we still have much to learn about the content and the actions of the human genome. By expanding our efforts to the entire genome, we expect to discover many more genome features that will help us to understand human genome biology and the role of genome elements in health and disease."

Dr Tim Hubbard, Head of Informatics at the Sanger Institute​

Dr Hubbard will lead an international team that will integrate a range of computational and informational studies of genes to develop accurate 'annotation' of the genome - the description of gene locations and structures. The project team will provide additional information about the strength of gene annotation and, by combining informatics and experimental approaches, will enhance the quality of catalogue of human genes.

"We have confidence that through this integrated project we will be able to eliminate many of the remaining uncertainties about the precise location of genes, their components and transcript structure in the human genome."

Dr Tim Hubbard, Head of Informatics at the Sanger Institute​

The project will build on the manual gene annotation work of the Sanger HAVANA group, led by Dr Jen Harrow, which played a major part in the ENCODE pilot and the automatically generated human gene annotation from the joint Sanger/EBI Ensembl project.

The new $80M ENCODE programme is a multidisciplinary programme that brings a range of cutting-edge research methods to the study of the genome, which were examined in the pilot phase. In addition to the research grants to support expansion of the ENCODE project, NHGRI also announced awards today for two pilot-scale projects, the establishment of an ENCODE data coordination centre and six projects to develop novel methods and technologies aimed at helping the ENCODE project achieve its goals.

"Based on ENCODE's early success, we are moving forward with a full-scale initiative to build a parts list of biologically functional elements in the human genome. The ENCODE pilot produced findings that are reshaping many long-held views about our genome. ENCODE's effort to survey the entire genome will uncover even more exciting surprises, providing us with a more complete picture of the biological roots of human health and disease."

NHGRI Director Francis S. Collins, MD, PhD

In June, the ENCODE research consortium published a set of landmark papers in the journals Nature and Genome Research that found the organization, function and evolution of the genome to be far more complicated than most had suspected. For example, while researchers have traditionally focused on studying genes and their associated proteins, the ENCODE data indicate the genome is a very complex, interwoven network in which genes are just one of many types of DNA sequences with functional impact.

"As was the case for the Human Genome Project and the ENCODE pilot, all of the data generated by the full-scale ENCODE project will be deposited into public databases as soon as they are experimentally verified. Free and rapid access to this data will enable researchers around the world to pose new questions and gain new insights into how the human genome functions."

Peter Good PhD, programme director for genome informatics in NHGRI's Division of Extramural Research

Notes to Editors
ENCODE Scale-up Grantees (total $65.6M)
  • Bradley Bernstein, Broad Institute of MIT and Harvard, Cambridge, MA, USA; High-Throughput Sequencing of Chromatin Regulatory Elements
  • Gregory Crawford, Duke University Institute for Genome Sciences & Policy, Durham, NC, USA; Comprehensive Identification of Active Functional Elements in Human Chromatin
  • Thomas Gingeras, Affymetrix Inc., Santa Clara, CA, USA; Comprehensive Characterization and Classification of the Human Transcriptome
  • Tim Hubbard, Wellcome Trust Sanger Institute, Hinxton, UK; Integrated Human Genome Annotation: Generation of a Reference Gene Set
  • Richard Myers, Stanford University, Stanford, CA, USA; Global Annotation of Regulatory Elements in the Human Genome
  • Michael Snyder, Yale University, New Haven, CN, USA; Production Center for Global Mapping of Regulatory Elements
  • John Stamatoyannopoulos, University of Washington, Seattle, WA, USA; A Comprehensive Catalog of Human DNase I Hypersensitive Sites

Other funding is awarded for data coordination, additional pilot projects and new technology development

Participating Centres and principle investigators in the Integrated Annotation Programme
  • Tim Hubbard, Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge CB10 1SA, UK
  • Alexandre Reymond, The University of Lausanne, Switzerland
  • Roderic Guigo, Centre de Regulació Genòmica (CRG), Barcelona, Catalonia, Spain
  • David Haussler, University of California, Santa Cruz (UCSC) Santa Cruz, California, USA
  • Michael Brent, Washington University, St Louis (WashU), St Louis, USA
  • Manolis Kellis, Massachusetts Institute of Technology (MIT), Boston, USA
  • Mark Gerstein, Yale University (Yale), New Haven, USA
  • Alfonso Valencia, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
Selected Websites
Contact the Press Office

Emily Mobley, Media Manager

Tel +44 (0)1223 496 851

Dr Samantha Wynne, Media Officer

Tel +44 (0)1223 492 368

Dr Matthew Midgley, Media Officer

Tel +44 (0)1223 494 856

Wellcome Sanger Institute,
CB10 1SA,

Mobile +44 (0) 7748 379849

Recent News

From childhood cancer to mapping human development: Pioneering scientist awarded 2019 Foulkes Foundation medal
Congratulations to Professor Muzlifah Haniffa, who has been recognised for her work on the immune system and childhood cancer
Dr Sam Behjati wins Liddy Shriver Early Career Research Award
Congratulations to Sanger Institute cancer researcher, Dr Sam Behjati, who has been recognised for his work in sarcoma cancer genomics
New malaria drug targets identified in liver stage of life cycle
Hope that new liver-stage drugs will help to counter the threat of antimicrobial resistance to current blood-stage medicines