Mouse Genomes Project

Data release policy

The Mouse Genomes Project releases sequence data, SNPs and other variant calls as a service to the research community.

These data are released in accordance with the Fort Lauderdale agreement and Toronto agreements. As producers of these data we reserve the right to be the first to publish a genome-wide analysis of the data we have generated.

The pre-publication data that we release via this website is embargoed for publication except for analyses of single chromosomes in single strains or single gene loci across multiple strains. We strongly encourage researchers to contact us if there are any queries about referencing or publishing analysis based on pre-publication data obtained via this website.

More information on the Wellcome Trust Sanger Institute's data sharing policy.

Enquiries

For further information, please contact mousegenomes@sanger.ac.uk

Latest News

  • 2014-10-27 New Variants Release
    New sequencing data and variants release for 28 laboratory strains. Query variants here. Raw data files available from our ftp site.
  • 2014-09-23 Upcoming new strains
    Two additional strains: BUB/BnJ and SEA/GnJ. Data processing and variant calling in progress - 28 strain website release scheduled for IMGC 2014.
  • 2014-07-08 Upcoming new strains
    C57/10J, C57BR/cdJ, C58/J, DBA/1J, I/LnJ, MOLF/EiJ, NZB/BlNJ, and NZW/LacJ. Data processing and variant calling in progress. A collaboration with Kent Hunter.
  • 2013-07-24 Accession Numbers
    We have made a minor update to this page to include the ENA accession numbers beside each strain name that correspond to the latest HiSeq 100bp sequencing data.
  • 2013-04-16 New Variants Pages
    We have launched a new interface for querying all of the sequence variants. It has been updated to include the latest variant release on GRCm38.
  • 2013-03-20 GRCm38 Release
    A new release of the BAM files, SNPs/indels, and structural variants has been made on the latest mouse reference genome (GRCm38). Please see the README files in the folders for more details. Our variant querying web page will be updated to the new release in the coming weeks.
  • 2013-02-27 Denovo Assemblies
    Preliminary scaffolds from SGA de novo assembly of 16 strains. Fasta files are in REL-1302-Assembly.These are scaffolds only and have not yet been organised into chromosomes.
  • 2013-02-12 New SV Release
    A new release of the structural variant calls has been made on our ftp site. This release incorporates the structural variation genotypes for the FVB/NJ strain. Also, we have updated the last SNP/indel release with a minor change to the VCF to fix the annotations for multi-allelic sites (the sites/genotypes are unchanged).
  • 2012-12-04 New SNP/indel Release
    A new release of the SNPs and indels for 16 of the mouse strains has been produced using the new higher quality sequencing data. The VCF files are available from the ftp site and the website query pages will be updated in the coming weeks. A new submission to dbSNP is also being prepared.
  • 2012-10-16 New Sequencing Data
    New higher quality sequencing data (100bp, ~40x, HiSeq platform) is available for 16 of the strains. This has been uploaded to the ftp site. New variant calls will be posted soon.
  • 2012-08-03 New Sequence Data
    New higher quality sequencing data (100bp, ~40x, HiSeq platform) is available on our ftp site for several of the strains - the rest will be posted soon.
  • 2012-06-12 FVB Variant Calls
    We have posted the full set of FVB variant calls (SNPs, short indels, and structural variants) on our ftp site in VCF format. The query pages on this site will be updated in the near future to incoroporate these calls.
  • 2011-11-17 FVB Strain Sequenced
    We have completed sequencing of the FVB/NJ strain to 53x sequence coverage. SNP, indel, and structural variation calls will be posted soon. The raw data is available for download from our ftp site and can browsed in LookSeq.
  • 2011-09-15 Nature Publications
    We are delighted to announce that two papers have been published in Nature describing the variation found and its impact on mouse phenotypes and traits. Further details.
  • 2011-08-05 Structural Variation Calls
    The entire set of structural variation calls across the 17 strains have been posted on our ftp site in the current_svs directory. These have also been submitted to DGVa under accession estd118.
  • 2011-06-30 Whole-brain RNA-seq BAMs
    We have just posted the BAM files and top-hat expression values for the whole-brain RNA-Seq data on our ftp site in the current_rna_bams directory. Note this data is also available from the ENA under accession ERP000614
  • 2011-06-27 Multi-tissue RNA-Seq
    We have recently completed RNA-Seq from a cross of C57BL/6J and DBA/2J across 6 different tissues and this data is available from the ENA under accession ERP000591
  • 2011-06-24 Whole-brain RNA-Seq
    We have recently completed sequencing whole-brain RNA from 15 of the strains using the Illumina RNA-Seq protocol. The data has been accessioned at the ENA under ERP000614.
  • 2011-06-24 Long Fragment end Sequencing
    We have recently completed sequencing the ends of 3kb fragments on the illumina platform across the mouse strains. The data is available from the ENA under study accession ERP000255
  • 2011-06-23 Variant Release & dbSNP
    We have made the final release of the SNP and short indel variants. The only change from the previous release (REL-1101) is that we have filtered out chrY calls. These SNPs and indels have been submitted to dbSNP as the final published set. The SNPs and short indels are available from our ftp site. The BAM alignment files have not been changed.
  • 2011-06-22 Accession Numbers
    The raw data has all been submitted to the ENA and the accession numbers and links are listed with the strain names.
  • 2010-07-27 Variant Releases
    The variant calls have been steadily updated over the past few months. The Data Release section below has links to all of the latest sets of SNP and short indel calls and the online browser has also been updated to reflect the new callsets. The BAM alignment files have not been changed.
  • 2010-01-13 New release
    The new release data have been loaded and are visible in the SNP query and LookSeq pages. 14 of the 17 strains are over 20x coverage (three over 30x) with the others (129S5, NZO, and WSB) all over 15x.
  • 2009-12-11 New Sequencing Data
    We have made a new freeze of the sequencing data with almost all of the strains sequenced to over 20x depth. This new data has now been loaded into the Lookseq viewer. We are preparing new SNP/indel calls and hope to have these searchable very soon.
  • 2009-11-03 IMGC
    Here is the poster we presented at IMGC
  • 2009-11-03 Sequencing Progress
    We now have 10 strains over 20x sequencing depth and have begun to prepare a new data release (SNP, indel, SV calls) for these strains. We'll post and update when the new data has been loaded.
* quick link - http://q.sanger.ac.uk/is84dfx7