Yu Fu, Moritz Gerstung, Spencer Phillips

Artificial intelligence finds patterns of mutations and survival in tumour images

AI detects patterns of 167 different mutations and predicts patient survival in 28 cancer types

Researchers at EMBL’s European Bioinformatics Institute (EMBL-EBI), the Wellcome Sanger Institute, Addenbrooke’s Hospital in Cambridge, UK, and collaborators have developed an artificial intelligence (AI) algorithm that uses computer vision to analyse tissue samples from cancer patients. They have shown that the algorithm can distinguish between healthy and cancerous tissues, and can also identify patterns of more than 160 DNA and thousands of RNA changes in tumours. The study, published today in Nature Cancer, highlights the potential of AI for improving cancer diagnosis, prognosis, and treatment.

Cancer diagnosis and prognosis are largely based on two main approaches. In one, histopathologists examine the appearance of cancer tissue under the microscope. In the other, cancer geneticists, analyse the changes that occur in the genetic code of cancer cells. Both approaches are essential to understand and treat cancer, but they are rarely used together.

“Clinicians use microscopy slides for cancer diagnosis all the time. However, the full potential of these slides hasn’t been unlocked yet. As computer vision advances, we can analyse digital images of these slides to understand what happens at a molecular level.”

Dr Yu Fu, Postdoctoral Fellow in the Gerstung Group at EMBL-EBI

Computer vision algorithms are a form of artificial intelligence that can recognise certain features in images. Fu and colleagues repurposed such an algorithm developed by Google – originally used to classify everyday objects such as lemons, sunglasses and radiators – to distinguish various cancer types from healthy tissue. They showed that this algorithm can also be used to predict survival and even patterns of DNA and RNA changes from images of tumour tissue.

Previous studies have used similar methods to analyse images from single or a few cancer types with selected molecular alterations. However, Fu and colleagues generalised the approach on an unprecedented scale: they trained the algorithm with more than 17 000 images from 28 cancer types collected for The Cancer Genome Atlas, and studied all known genomic alterations.

“What is quite remarkable is that our algorithm can automatically link the histological appearance of almost any tumour with a very broad set of molecular characteristics, and with patient survival.”

Dr Moritz Gerstung, Group Leader at EMBL-EBI

Overall, their algorithm was capable of detecting patterns of 167 different mutations and thousands of gene activity changes. These findings show in detail how genetic mutations alter the appearance of tumour cells and tissues.

Another research group has independently validated these results with a similar AI algorithm applied to images from eight cancer types. Their study was published in the same issue of Nature Cancer.

The integration of molecular and histopathological data provides a clearer picture of a tumour’s profile. Detecting the molecular features, cell composition, and survival associated with individual tumours would help clinicians tailor appropriate treatments to their patients’ needs.

“From a clinician’s point of view, these findings are incredibly exciting. Our work shows how artificial intelligence could be used in clinical practice. While the number of cancer cases is increasing worldwide, the number of pathologists is declining. At the same time, we strive to move away from the ‘one size fits all’ approach and into personalised medicine. A combination of digital pathology and artificial intelligence can potentially alleviate those pressures and enhance our practice and patient care.”

Dr Luiza Moore, Clinician Scientist and Pathologist at the Wellcome Sanger Institute and Addenbrooke’s Hospital

Sequencing technologies have propelled genomics to the forefront of cancer research, yet these technologies remain inaccessible to most clinics around the world. A possible alternative to direct sequencing would be to use AI to emulate a genomic analysis using data that is cheaper to collect, like microscopy slides.

“Getting all that information from standard tumour images in a completely automatic manner is revolutionary. This study shows what might be possible in the coming years, but these algorithms will have to be refined before clinical implementation.”

Alexander Jung, PhD student at EMBL-EBI

More information


Fu, Y., et al. (2020). Pan-cancer computational histopathology reveals mutations, tumor composition and prognosis, Nature Cancer. DOI: 10.1038/s43018-020-0085-8


This work was supported by the Novo Nordisk Foundation, Cancer Research UK and Wellcome.

Selected websites

  • The European Bioinformatics Institute (EMBL-EBI)

    The European Bioinformatics Institute (EMBL-EBI) is a global leader in the storage, analysis and dissemination of large biological datasets. We help scientists realise the potential of ‘big data’ by enhancing their ability to exploit complex information to make discoveries that benefit humankind.

    We are at the forefront of computational biology research, with work spanning sequence analysis methods, multi-dimensional statistical analysis and data-driven biological discovery, from plant biology to mammalian development and disease.

    We are part of EMBL and are located on the Wellcome Genome Campus, one of the world’s largest concentrations of scientific and technical expertise in genomics.

    Website: www.ebi.ac.uk

  • Wellcome Sanger Institute

    The Wellcome Sanger Institute is a world leading genomics research centre. We undertake large-scale research that forms the foundations of knowledge in biology and medicine. We are open and collaborative; our data, results, tools and technologies are shared across the globe to advance science. Our ambition is vast – we take on projects that are not possible anywhere else. We use the power of genome sequencing to understand and harness the information in DNA. Funded by Wellcome, we have the freedom and support to push the boundaries of genomics. Our findings are used to improve health and to understand life on Earth. Find out more at www.sanger.ac.uk or follow us on TwitterFacebookLinkedIn and on our Blog.

  • About Wellcome

    Wellcome exists to improve health by helping great ideas to thrive. We support researchers, we take on big health challenges, we campaign for better science, and we help everyone get involved with science and health research. We are a politically and financially independent foundation. https://wellcome.org/