New tool allows analysis of single-cell RNA data in pre-malignant tumours

Single Cell Consensus Clustering (SC3) tool more accurate and robust than previous methods for sorting cells into groups

New tool allows analysis of single-cell RNA data in pre-malignant tumours

Using SC3 to define subclones from two patients with myeloproliferative neoplasm. Credit: Nature Methods DOI: 10.1038/nmeth.4236

Wellcome Trust Sanger Institute scientists and their collaborators have developed a new analysis tool that was able to show, for the first time, which genes were expressed by individual cells in different genetic versions of a benign blood cancer.

Single cell RNA sequencing can define cell types by revealing differences in the proteins produced by individual cells, however analysing the data remains challenging.  Reported in Nature Methods today, the new open source computer tool called Single Cell Consensus Clustering (SC3) was shown to be more accurate and robust than existing methods of analysing single-cell RNA sequence data, and is freely available for researchers to use*.

Recent advances in single-cell genomics technology has made it possible to separate individual cells from different tissues and organs, and measure the sets of RNA messages - called the transcriptome - which help give each cell its own identity. These individual transcriptomes can be used to define cell types and to understand the functions of healthy and diseased cells in the human body. This technology has enormous potential for biological research.

In order to analyse the transcriptomic data, similar cells need to be grouped together.  However, it is hard to know what criteria to use to group them, and the data is often very complex.  The researchers developed the SC3 computer tool to overcome these problems and validated it using several publicly available gold standard datasets.

“We created the new SC3 tool to analyse complex single-cell RNA-sequence data, and showed that it is more robust and accurate than existing methods at grouping cells.  The SC3 tool contains added features that help interpret the biological function of the cells in that group, such as lists of marker genes for each group.  We expect this will be used by many researchers around the world.”

Dr Vladimir Yu Kiselev, first author from the Sanger Institute

The SC3 tool was then used to analyse single-cell RNA-sequence data from two patients diagnosed with myeloproliferative neoplasm (MPN) blood cancers.  Pre-malignant MPN occurs when the bone marrow makes too many blood cells, and in 10 per cent of patients can lead to overt leukaemia.

Patients often have multiple versions of the cancer, called subclones, which have different mutations, and the researchers wanted to find if the expression levels of RNA correlated with the different mutations. Previous attempts to analyse the RNA datasets with other methods had failed, however SC3 was able to resolve the datasets and showed that each cancer-causing mutation led to different proteins being expressed.

“The SC3 tool was able to use patterns of gene expression to distinguish, within an individual cancer, subclones that carried different mutations. This approach will help us define the cellular heterogeneity within each cancer, an important step towards improving cancer treatment."

Professor Tony Green, an author from the Wellcome Trust-MRC Stem Cell Institute and Cambridge University

“It has been difficult to fully exploit single-cell RNA-sequence data due to the current lack of computational methods for analysing them. Our study shows that SC3 is an accurate and user-friendly tool, which can analyse complex datasets. We hope that this tool will help researchers gain new biological insights from transcriptome datasets in the future and provide information for diseases that affect specific cell types.”

Dr Martin Hemberg, lead author on the paper from the Wellcome Trust Sanger Institute

Notes to Editors
  • SC3: consensus clustering of single-cell RNA-seq data.

    Kiselev VY, Kirschner K, Schaub MT, Andrews T, Yiu A et al.

    Nature methods 2017;14;5;483-486

*SC3 is available from and the source code can be found at


This work is supported by Bloodwise (grant ref. 13003), the Wellcome Trust (grant ref. 104710/Z/14/Z), the Medical Research Council, the Kay Kendall Leukaemia Fund, the Cambridge NIHR Biomedical Research Center, the Cambridge Experimental Cancer Medicine Centre, the Leukemia and Lymphoma Society of America (grant ref. 07037), and a core support grant from the Wellcome Trust and MRC to the Wellcome Trust Medical Research Council Cambridge Stem Cell Institute.

Selected Websites
Contact the Press Office

Dr Samantha Wynne

Media Officer

Wellcome Trust Sanger Institute,
CB10 1SA,

Tel +44 (0)1223 492 368

Mobile +44 (0) 7900 607793

Fax +44 (0)1223 494 919

Recent News

Isolated Greek villages reveal genetic secrets that protect against heart disease

A genetic variant that protects the heart against cardiovascular disease has been discovered by researchers at the Wellcome Trust Sanger Institute and their collaborators.

Natural resistance to malaria linked to variation in human red blood cell receptors

First study to identify protective effect of glycophorin gene rearrangements on malaria

Scientists unveil the UK’s largest resource of human stem cells from healthy donors

Powerful resource created for scientists studying human development and disease