Data Quality Control (QC)

Scientific Operations

The Data QC team within DNA Pipelines Operations provide data QC, monitor standards and data outputs and troubleshoot issues for all Illumina® sequencing runs prior to the release and publication of data to the scientific community.

The areas in which the Data QC Team are involved in are as follows:

Data QC of all Illumina® Runs

DNA Pipelines Operations can have thousands of incoming samples weekly which are usually varied, often complex. The Data QC team are experienced at QC analysis of everything from high throughput whole genome and transcriptome sequencing through to more challenging samples such as G&T sequencing, bisulphite, Hi-C, GC/AT-rich, cancerous, malarial, single cell, pull-down from custom baits and custom primer sequencing. Each of these sample types have their own characteristics and need a broad knowledge base. Each sample type needs to be appropriately understood so we can be sure we are managing customer’s expectations and delivering the best output whilst considering any limitations for the given platform/process combination.

Monitoring standards and data outputs

The Data QC team work closely with our own in-house Scientific Service Representatives (SSRs), Production Software Development and DNA Pipelines Informatics teams and externally with Illumina® to both ensure that the data we release is the best that our customers can expect within their own research aims and for further improvements to both output and quality of data. This can include evaluating new reagents or platforms, but also examining trends within our processes and within each of our platforms – NovaSeq 6000, HiSeq X Ten, HiSeq 4000 and MiSeq. These processes require continual re-evaluation but is vital to ensure standards are maintained.

Troubleshooting of problematic runs

Whilst we have robust systems in place inevitably there are problems encountered within the sequencing runs, identified at the QC stage. We identify and understand the nature of the issue in order to determine the best course of action for resolution, working closely with other teams within DNA Pipelines. We provide troubleshooting to ensure that subsequently the correct data is sent to the appropriate customer so they can have high confidence in using it.

Our people

Core team

Elizabeth Huckle

Scientific Service Representative

Associated research

Collaborations

Collaboration

25 Genomes for 25 Years

The project's primary goal was to sequence 25 novel genomes representing UK biodiversity, as part of the Wellcome Sanger Institute' ...

Collaboration

Human Cell Atlas

The International Human Cell Atlas initiative aims to create comprehensive reference maps of all human cells—the fundamental units of life— ...

Collaboration

Vertebrate Genomes Project

The Vertebrate Genomes Project (VGP) at the Sanger Institute aims to provide reference quality assemblies for hundreds of fish, rodents and ...

Related groups

Science group

High-Throughput DNA Sequencing

Scientific Operations

The High Throughput DNA sequencing team within DNA Pipelines Operations is a highly automated high throughput team specialising in producing libraries ...

Science group

High-Throughput RNA and Laser Capture Microdissection Biopsy (LCMB) Sequencing

Scientific Operations

The High Throughput RNA and LCMB Sequencing team within DNA Pipelines Operations is a high throughput Illumina® library creation team ...

Science group

Long Read Sequencing

Scientific Operations

The long read sequencing team within DNA Pipelines Operations at the Wellcome Sanger Institute provides support for research projects requiring longer ...

Science group

New Pipeline Group (NPG)

Sequencing Informatics

NPG is responsible for the delivery of DNA Pipelines's data products and the provision of informatics expertise and QC systems.

Science group

Scientific Customer Support

Scientific Operations

The Scientific Customer Support team comprises Scientific Services Representatives (SSRs) who are the first point of contact for the Institute’s ...

Science group

Sequencing Informatics

Scientific Computing

From 2009 to 2023, the Sequencing Informatics group supported DNA Pipelines in the production of their data products, faculty's use ...

Wellcome Sanger Institute

Programmes and Facilities

Programme

DNA Sequencing

The DNA Sequencing area teams of the Wellcome Sanger Institute support the research of all scientists in their use of ...

Careers and Study

Policies

Archive

Leadership

Faculty

Data Quality Control (QC)

Data QC of all Illumina® Runs

Monitoring standards and data outputs

Troubleshooting of problematic runs

Our people

Core team

Elizabeth Huckle

Associated research

25 Genomes for 25 Years

Human Cell Atlas

Vertebrate Genomes Project

Related groups

High-Throughput DNA Sequencing

High-Throughput RNA and Laser Capture Microdissection Biopsy (LCMB) Sequencing

Long Read Sequencing

New Pipeline Group (NPG)

Scientific Customer Support

Sequencing Informatics

Programmes and Facilities

DNA Sequencing