Sequencing of Idd regions in the NOD mouse genome

Sequencing of Idd regions in the NOD mouse genome

Sequencing of Idd regions in the NOD mouse genome

The Wellcome Sanger Institute undertook the genome sequencing of regions of the non-obese diabetic (NOD) mouse relevant to type 1 diabetes (T1D), also known as insulin-dependent diabetes (Idd). Comparing the sequences of Idd candidate regions between the diabetes-sensitive NOD mouse and the diabetes-resistant C57BL/6J reference mouse allowed identification of genomic variations putatively associated with diabetes in mice and, by extension, in humans. It was hoped that this research would provide a better understanding of potential immunogenomic loci responsible for the initiation and progression of autoimmune destruction of insulin-producing β-cells, eventually paving the way to potential targets in therapy.

About the Partnership

nod_mouse_jax_1.pngJackson laboratory
The non-obese diabetic mouse


T1D is a polygenic autoimmune disease that is characterised by hyperglycaemia and is fatal if not treated. It results from the progressive autoimmune T cell-mediated destruction of the insulin-producing pancreatic β-cells of the islets of Langerhans. Disease frequency is attributable to the interaction of the environment on alleles at numerous loci in the genome, with nearly 50 loci known to be involved. T1D is typically associated with specific allelic variants of the Major Histocompatability Complex (MHC) class I and class II genes within the MHC, a region of the genome that is critical for mounting immune and autoimmune responses. To date however, the MHC is the only locus that has been found to be essential for the manifestation of this disease.

The NOD mouse spontaneously develops T1D and is a model organism since it shares multiple characteristics with the human disease. Such characteristics include genetic polymorphisms that affect shared pathways, common antigenic targets, and the expression of class II MHC molecules displaying related peptides. The NOD mouse therefore represents a valuable tool for studying the genetics of T1D and for evaluating therapeutic interventions.


The NOD bacterial artificial chromosome (BAC) sequencing was funded by Immune Tolerance Network (ITN) Contract AI 15416, which was sponsored by the National Institute of Allergy and Infectious Diseases (NIAID), the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK), and the Juvenile Diabetes Research Foundation International (JDRF).

BAC libraries

Two NOD mouse BAC libraries were constructed and the BAC ends sequenced. Clones from the DIL NOD BAC library constructed by RIKEN Genomic Sciences Centre (Japan) in conjunction with the Diabetes and Inflammation Laboratory (DIL) (University of Cambridge) from the NOD/MrkTac mouse strain are designated DIL. Clones from the CHORI-29 NOD BAC library constructed by Pieter de Jong (Children's Hospital, Oakland, California, USA) from the NOD/ShiLtJ mouse strain are designated CHORI-29.

All NOD mouse BAC end-sequences have been submitted to the International Nucleotide Sequence Database Consortium (INSDC), deposited in the NCBI trace archive. We have generated a clone map from these two libraries by mapping the BAC end-sequences to the latest assembly of the C57BL/6J mouse reference genome sequence. These BAC end-sequence alignments can then be visualized in the Ensembl mouse genome browser where the alignments of both NOD BAC libraries can be accessed through the Distributed Annotation System (DAS).

The Mouse Genomes Project has used the Illumina platform to sequence the entire NOD/ShiLtJ genome and this should help to position unaligned BAC end-sequences to novel non-reference regions of the NOD genome. Further information about the BAC end-sequences, such as their alignment, variation data and Ensembl gene coverage, can be obtained from the NOD mouse ftp site.

Library construction, sequencing and mapping details for both NOD mouse BAC libraries.

Library name

Strain nameNOD/MrkTacNOD/ShiLtJ
SourceFemale liverMale kidney

RIKEN Genomic Sciences Centre, Japan

Children's Hospital Oakland, California, USA
ContactDr. Jayne DanskaBACPAC resources
Total number of BAC clones196,032


Passed BAC clones150,87875,046
BAC clones mapped successfully125,26662,162
Passed BAC end-sequences332,535


Sanger clone prefixbQ


Average BAC insert size bp149,809205,413
Accession numbers of BAC end-sequences


mouse_strain-mhc-small.pngGenome Research Limited
Idd regions currently annotated by HAVANA

Targeted sequencing

In conjunction with external collaborators studying the genetics of NOD mice, clones covering defined Idd candidate regions were selected for whole BAC sequencing from either of the two NOD mouse libraries using the BAC end-sequence alignments in Ensembl. Sequencing was then carried out using T7 and SP6 primers on the vector, and big dye terminator chemistry. In parallel, the quality of the corresponding sequence in the C57BL/6J reference mouse has been checked. Clones from the DIL NOD BAC library constructed by RIKEN Genomic Sciences Centre (Japan) in conjunction with the Diabetes and Inflammation Laboratory (DIL) (University of Cambridge) from the NOD/MrkTac mouse strain are designated DIL. Clones from the CHORI-29 NOD BAC library constructed by Pieter de Jong (Children's Hospital, Oakland, California, USA) from the NOD/ShiLtJ mouse strain are designated CHORI-29.

Finished clones from the targeted Idd candidate regions are displayed in the NOD clone sequence section of the website, where they can be downloaded either as individual clone sequences or larger contigs that make up the accession golden path (AGP). Click here.

The importance and utility of these high quality finished sequences is demonstrated further by the essential role that the NOD/ShiLtJ strain derived CHORI-29 NOD BACs played in calibrating the variation calling software for the Mouse Genomes Project. All sequences are publicly available via the INSDC.

Manual annotation and analysis

Analysed and manually annotated sequences have been generated using in-house developed software in accordance with the manual annotation guidelines, and are available through the Vertebrate Genome Annotation browser Vega. Completed C57BL/6J annotation can also be viewed in the Vega genome browser alongside the NOD sequence. This allows comparison of the genomic sequence and genes in the candidate loci between diabetes resistant and diabetes sensitive strains, looking for example for SNPs, and is a useful way of identifying regions of difference between the two mouse strains. We have analysed the homologous C57BL/6J mouse and NOD mouse sequences and identified variation consequences. These data are available from the NOD mouse ftp site.


  • Identification of the inflammasome Nlrp1b as the candidate gene conferring diabetes risk at the Idd4.1 locus in the nonobese diabetic mouse.

    Motta VN, Markle JG, Gulban O, Mortin-Toth S, Liao KC et al.

    Journal of immunology (Baltimore, Md. : 1950) 2015;194;12;5663-73

  • The Vertebrate Genome Annotation browser 10 years on.

    Harrow JL, Steward CA, Frankish A, Gilbert JG, Gonzalez JM et al.

    Nucleic acids research 2014;42;Database issue;D771-9

  • Fine mapping of type 1 diabetes regions Idd9.1 and Idd9.2 reveals genetic complexity.

    Hamilton-Williams EE, Rainbow DB, Cheung J, Christensen M, Lyons PA et al.

    Mammalian genome : official journal of the International Mammalian Genome Society 2013;24;9-10;358-75

  • The non-obese diabetic mouse sequence, annotation and variation resource: an aid for investigating type 1 diabetes.

    Steward CA, Gonzalez JM, Trevanion S, Sheppard D, Kerry G et al.

    Database : the journal of biological databases and curation 2013;2013;bat032

  • The B10 Idd9.3 locus mediates accumulation of functionally superior CD137(+) regulatory T cells in the nonobese diabetic type 1 diabetes model.

    Kachapati K, Adams DE, Wu Y, Steward CA, Rainbow DB et al.

    Journal of immunology (Baltimore, Md. : 1950) 2012;189;10;5001-15

  • Mouse genomic variation and its effect on phenotypes and gene regulation.

    Keane TM, Goodstadt L, Danecek P, White MA, Wong K et al.

    Nature 2011;477;7364;289-94

  • Evidence that Cd101 is an autoimmune diabetes gene in nonobese diabetic mice.

    Rainbow DB, Moule C, Fraser HI, Clark J, Howlett SK et al.

    Journal of immunology (Baltimore, Md. : 1950) 2011;187;1;325-36

  • Nonobese diabetic congenic strain analysis of autoimmune diabetes reveals genetic complexity of the Idd18 locus and identifies Vav3 as a candidate gene.

    Fraser HI, Dendrou CA, Healy B, Rainbow DB, Howlett S et al.

    Journal of immunology (Baltimore, Md. : 1950) 2010;184;9;5075-84

  • Genome-wide end-sequenced BAC resources for the NOD/MrkTac() and NOD/ShiLtJ() mouse genomes.

    Steward CA, Humphray S, Plumb B, Jones MC, Quail MA et al.

    Genomics 2010;95;2;105-10

  • Ly49 cluster sequence analysis in a mouse model of diabetes: an expanded repertoire of activating receptors in the NOD genome.

    Belanger S, Tai LH, Anderson SK and Makrigiannis AP

    Genes and immunity 2008;9;6;509-21

  • Interleukin-2 gene variation impairs regulatory T cell function and causes autoimmunity.

    Yamanouchi J, Rainbow D, Serra P, Howlett S, Hunter K et al.

    Nature genetics 2007;39;3;329-37

  • Molecular genetic analysis of the Idd4 locus implicates the IFN response in type 1 diabetes susceptibility in nonobese diabetic mice.

    Ivakine EA, Gulban OM, Mortin-Toth SM, Wankiewicz E, Scott C et al.

    Journal of immunology (Baltimore, Md. : 1950) 2006;176;5;2976-90

  • Natural genetic variants influencing type 1 diabetes in humans and in the NOD mouse.

    Wicker LS, Moule CL, Fraser H, Penha-Goncalves C, Rainbow D et al.

    Novartis Foundation symposium 2005;267;57-65; discussion 65-75

  • Fine mapping, gene content, comparative sequencing, and expression analyses support Ctla4 and Nramp1 as candidates for Idd5.1 and Idd5.2 in the nonobese diabetic mouse.

    Wicker LS, Chamberlain G, Hunter K, Rainbow D, Howlett S et al.

    Journal of immunology (Baltimore, Md. : 1950) 2004;173;1;164-73

  • Identification of a structurally distinct CD101 molecule encoded in the 950-kb Idd10 region of NOD mice.

    Penha-Gonçalves C, Moule C, Smink LJ, Howson J, Gregory S et al.

    Diabetes 2003;52;6;1551-6