COSMIC, the "Catalogue Of Somatic Mutations In Cancer" is an expert-curated database encompassing the wide variety of somatic mutation mechanisms causing human cancer ( http://cancer.sanger.ac.uk). Growing in both content and scope, COSMIC holds details on millions of mutations across thousands of cancer types. Hand-curation of key cancer genes (selected from the Cancer Gene Census) provide in-depth detail on mutation distributions and effects, whilst semi-automated curation of cancer genomes provides broad somatic annotations toward target discovery and identification of patterns and signatures. This information is fully available via website or download, updated every three months.
Cancer Gene Census : This is a list of hundreds of genes with substantial published evidence in oncology. Necessarily conservative, this is a very high-confidence list based on good-quality publications. Selection of high-impact genes from this list for curation drives COSMIC.
COSMIC : Upon selection of a gene from the Census for full expert curation, all papers mentioning its mutation in human cancer are collected and exhaustively curated before it is released into a new version of COSMIC. Once this initial curation is released, the gene is updated as significant new information is published. Each curator is responsible for a defined set of 60 or more genes, developing substantial expertise. In parallel, cancer genomes are curated via a more bioinformatic approach. Genomic data is obtained in standard formats; roughly half is from published supplementary information tables, and the other half from genome consortia such as TCGA, ICGC. Standard pipelines (eg Ensembl VEP) annotate these genomic data in genic terms for COSMIC release. Such molecular profiling includes point mutations, gene fusions, copy number annotations, structural breakpoints, gene expression and CpG island methylation variants.
Cancer Cell Lines Project : The Cell lines Project in COSMIC is an effort to fully profile over 1000 cell lines regularly used in cancer research; annotations include exome sequencing, CNV and gene expression profiling, RNASeq and CpG methylation. This information is maintained in a separate, but parallel system alongside COSMIC and regularly updated to highlight the most valuable information across the cell line panel.