Contact WTSI Webmaster Printer friendly format Login to WTSI resources WTSI RSS feed
Scientific Divisions
  • Human Genetics
  • Model Organisms
  • Pathogens
  • Bioinformatics
  • Sequencing
  • Genevar
  • Home
  • Data
  • Website Search
  • People Search
  • Library Services
  • Site Map
  • Feedback / Help
GENEVAR - GENe Expression VARiation

Analysis of gene expression variation in the HapMap samples using genome-wide expression arrays.

We have produced a dataset of gene expression data from EBV-transformed lymphoblastoid cell lines from all 270 HapMap individuals used in the phase I and phase II of the project (populations: CEU, CHB, JPT and YRI).

This is a collaborative effort that involves the groups of:

Groups
  • Manolis Dermitzakis, Wellcome Trust Sanger Institute
  • Panos Deloukas, Wellcome Trust Sanger Institute
  • Simon Tavare, University of Cambridge
  • Andrew Clark, Cornell University

Our groups aim at performing various types of global analysis in this dataset such as:

Analysis
  • associations to SNP and haplotype variation
  • associations to copy number variants (CNVs)
  • identify functionally variable regulatory regions
  • study regulatory networks
  • quantify population differentiation using expression phenotype data

These data are being released freely to the scientific community and can be considered a community resource. However, the data generators reserve the right to be the first to publish on the bulk data as indicated by the Fort Lauderdale meeting report (see data release policy below).

A pilot effort of this study with a small set of genes has been published in:

Genome-wide associations of gene expression variation in humans.
Stranger BE, Forrest MS, Clark AG, Minichiello MJ, Deutsch S, Lyle R, Hunt S, Kahl B, Antonarakis SE, Tavaré S, Deloukas P, Dermitzakis ET
PLoS Genet. 2005;1;e78. PMID: 16362079 DOI: 10.1371/journal.pgen.0010078
Relative impact of nucleotide and copy number variation on gene expression phenotypes.
Stranger BE, Forrest MS, Dunning M, Ingle CE, Beazley C, Thorne N, Redon R, Bird CP, de Grassi A, Lee C, Tyler-Smith C, Carter N, Scherer SW, Tavaré S, Deloukas P, Hurles ME, Dermitzakis ET
Science. 2007;315;848-53. PMID: 17289997 DOI: 10.1126/science.1136678
Population genomics of human gene expression.
Stranger BE, Nica AC, Forrest MS, Dimas A, Bird CP, Beazley C, Ingle CE, Dunning M, Flicek P, Koller D, Montgomery S, Tavaré S, Deloukas P, Dermitzakis ET
Nat Genet. 2007;39;1217-24. PMID: 17873874 DOI: 10.1038/ng2142

For any questions regarding the data and the data release policy please contact:
Manolis Dermitzakis (md4@sanger.ac.uk)
Barbara Stranger (bes@sanger.ac.uk)

The data was generated using illumina genome-wide expression arrays (wg GEX human 6). Each individual has 4 replicate hybridizations.

NEWS
Site Updates
  • Sept 16, 2007: Paper published in Nature Genetics
  • April 17, 2007: Data normalized across all populations available
  • February 9, 2007: Paper published in Science
  • June 1, 2006: Normalized gene expression data available
  • May 10, 2006: Raw gene expression data available
DATA DOWNLOAD

Raw and normalized data sets as well as a file with transcript information can be downloaded here.
The files of the normalized data are now available.

ACKNOWLEGDMENTS

Stranger B. E., Forrest M. S. , Dunning M. , Ingle C., Minichiello M. J., Kahl B., Clark A. G., Tavare S., Deloukas P., Dermitzakis E. T.

Funding was provided by the Wellcome Trust.

DATA RELEASE POLICY

The release of pre-publication data from large resource-generating scientific projects was the subject of a meeting held in January 2003, the "Fort Lauderdale meeting", sponsored by the Wellcome Trust, one of the Project funders. The report from that meeting can be viewed here.

The recommendations of the Fort Lauderdale meeting address the roles and responsibilities of data producers, data users, and funders of "community resource projects", with the aim of establishing and maintaining an appropriate balance between the interests of data users in rapid access to data and the needs of data producers to receive recognition for their work. The conclusion of the attendees at the meeting was that responsible use of the data is necessary to ensure that first-rate data producers will continue to participate in such projects and produce and quickly release valuable large-scale data sets. "Responsible use" was defined as allowing the data producers to have the opportunity to publish the initial global analyses of the data, as articulated at the outset of the project. Doing so also will ensure that the data generated are fully described.

Human Genetics Model Organisms Pathogen Biology Bioinformatics Sequencing
Section Home
Cancer Genome Project
COSMIC
Statistical Genetics
Human Genome Project
Case-Control Consortium
Section Home
Mouse
Zebrafish
C. elegans
S. pombe
Section Home
Bacteria
Protozoa
Helminths
Section Home
Software
Databases
Blast
Ensembl
Vega
GeneDB
Section Home
Sequencing Projects
sequencing Information

webmaster@sanger.ac.uk

Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK  Tel:+44 (0)1223 834244

Last Modified Fri Oct 5 17:41:01 2007

Genome Research Limited is a charity registered in England with number 1021457

Data Sharing Policy | Conditions of Use | Copyright