Genomics API supports Global Alliance for Genomics and Health

Global Alliance for Genomics and Health unveils new genomics API allowing for seamless sharing of genetic data

Email newsletter

News and blog updates

Sign up

Global Alliance for Genomics and Health
Interoperability: One API, many apps. The API (application programming interface) will allow the interoperable exchange of genomic information across multiple organisations and on multiple platforms.
On 31 July 2014 the Global Alliance for Genomics and Health (GA4GH) announced a new Application Programming Interface (API) developed by the Global Alliance’s Data Working Group that will allow DNA data providers and consumers to better share information and work together on a global scale.

This new open source Genomics API, referred to as Version 0.5, is a standard, open tool promoting data interoperability, and will be part of a suite of Genomics APIs being developed by the Global Alliance. The API enables the interoperable exchange of information contained in DNA sequence reads across multiple organizations and on multiple platforms.

The new Genomics API is one of the first products to be developed and distributed by the Global Alliance for Genomics and Health, which was formed only one year ago and is made up of over 200 of the world’s leading biomedical research institutions, healthcare providers, information technology and life science companies, funders of research, and disease and patient advocacy organisations.

“This new Genomics API is an exciting step toward interoperability in genomic data. It advances the Global Alliance’s mission of enabling the sharing of genomic and clinical data to improve human health. Because this new API lets researchers work consistently with genomic data across institutions and platforms, it will help realise the benefits that come from large-scale genomic data sharing, allowing us to find the needle in the haystack for patients with rare diseases.”

David Haussler Co-Chair of the Global Alliance’s Data Working Group and Scientific Director of the UC Santa Cruz Genomics Institute

Promoting the Global Alliance’s goals of transparency and collaboration, this new Genomics API Version 0.5 uses an open development process to allow the wider bioinformatics community to participate. While the Data Working Group has a core team of active developers, all interested developers from any institution can further engage with this platform by exploring sample apps, building implementations from scratch or from existing samples, or by providing feedback on the API and its documentation. The interface is managed in an open Global Alliance developer site.

The newly announced Genomics API Version 0.5 builds off of the successful Version 0.1, which was also developed by members of the Data Working Group and is in use by leading organisations, including the European Bioinformatics Institute (EMBL-EBI), the U.S. National Center for Biotechnology Information (NCBI), Google, Genome Savant, and Harvard Medical School’s Biomedical Cybernetics Laboratory, powering a growing community of applications. As analysis tools adopt the new API, researchers will be able to extend their own infrastructure to utilise cloud resources, such as those available from Amazon Web Services, Google Cloud Platform, and Microsoft Azure.

The GA4GH Genomics API is built on the file formats developed over the last five years for large-scale genomic sequencing projects, now also managed by the Global Alliance, but features cleaner models, with a modern, easy-to-use data description schema and a web-enabled interface.

“Genome sequencing is transitioning from being a powerful research tool to making an enormous impact in clinical diagnostics and care. This API from the Global Alliance Data Working Group will enable genomic data processing to move beyond research file formats into modern computing and data architectures, facilitating controlled data sharing and the effective use of these new technologies for both clinical and research benefit.”

Dr Richard Durbin Acting Head of Computational Genomics at the Wellcome Trust Sanger Institute and leader of the Genome Informatics group

“Modern DNA sequencing, when coupled with modern data and cloud technology, can lead to breakthroughs in understanding and improving human health. This new Genomics API is a big step forward. Google already supports Version 0.1 of the API, and we’ll be adding support for Version 0.5 soon, as well as continuing to contribute to the Data Working Group.”

David Glazer Co-chair of the Reads Task Team and Engineering Director for Google Cloud Platform and Google Genomics

“The Global Alliance is breaking new ground in combining genomic sequencing and clinical care. Amazon Web Services is proud to support these efforts, and help in defining new operating models, such as the latest Genomics API. We view these new APIs as a vital component for collaboration and development of next-generation tools that can run cost-effectively at massive scale.”

Matt Wood General Manager of Data Science, Amazon Web Services, Inc.

“We are using the Global Alliance’s work to enable apps for the TBResist initiative that bridge from raw sequence data to clinically useful phenotypes. Also, the Substitutable Medical Applications and Reusable Technology (SMART) Genomics platform is using the Global Alliance interface to enable interoperability between electronic medical record information (HL7) and raw genetic sequence information.”

Professor Gil Alterovitz A faculty at the Harvard Medical School and director of the Biomedical Cybernetics Laboratory

Other Working Groups of the Global Alliance for Genomics and Health are currently identifying best practices to integrate genomic data into clinical practice, reaching agreement on security protocols, and developing a framework to address ethics and regulatory considerations.

More information

Selected websites

  • Global Alliance for Genomics and Health

    The Global Alliance for Genomics and Health is an international, non-profit alliance formed to help accelerate the potential of genomic medicine to advance human health. Bringing together over 200 leading institutions working in healthcare, research, disease and patient advocacy, life science, and information technology, partners in the Global Alliance are working together to create a common framework of standards and harmonised approaches to enable the responsible, voluntary, and secure sharing of genomic and clinical data. Learn more at: http://genomicsandhealth.org

  • The Wellcome Trust Sanger Institute

    The Wellcome Trust Sanger Institute is one of the world’s leading genome centres. Through its ability to conduct research at scale, it is able to engage in bold and long-term exploratory projects that are designed to influence and empower medical science globally. Institute research findings, generated through its own research programmes and through its leading role in international consortia, are being used to develop new diagnostics and treatments for human disease.

  • The Wellcome Trust

    The Wellcome Trust is a global charitable foundation dedicated to achieving extraordinary improvements in human and animal health. We support the brightest minds in biomedical research and the medical humanities. Our breadth of support includes public engagement, education and the application of research to improve health. We are independent of both political and commercial interests.