The NPG team liaise with production teams and faculty informatics (the customer element directly using the product) to learn of desired features for processing and common best practices. Senior DNA Pipelines management pass to NPG expected changes in the nature of the product and demand e.g. new instruments, new instrument types and new projects with expected throughput and processing requirements.
From these, small requirements are considered and typically a solution proposed, provided and reviewed by a subset of the NPG team. Larger requirements are dealt with as projects. When appropriate we’ll contact Wellcome Sanger Institute Systems asking for advice, review, new services or enhancements.
NPG reviews continuously its systems with an eye to product trends and implements long rolling projects to accommodate them. e.g. adoption of BAM format based analysis pipelines (addressing provenance records in a multiple path analysis suite, and reducing IO) and increased data streaming between analysis tools (to reduce IO requirements, avoid the need for a high-performance file-system, allow for an easily scalable system).
The NPG team has to maintain informatics and sequencing knowledge (including Illumina, PacBio and ONT) sufficient to anticipate and understand DNA Pipelines customer requirements so that requests can be anticipated, evaluated and rejected or implemented appropriately.
The Data QC team within DNA Pipelines Operations provide data QC, monitor standards and data outputs and troubleshoot issues for all Illumina® sequencing runs prior to the release and publication of data to the scientific community.
HPA has created tools which are essential components of NPG's current analysis pipelines and leads on developing and testing software for using some of the potential next generation of production sequencing technologies. HPA, in its role of creating custom assemblies, is also a consumer of NPG's data products.
Human Genetics Informatics (HGI) supports the scientific aims of the Human Genetics programme by developing and operating computational analysis workflows, managing shared storage, and providing bioinformatics software tools for the use of researchers across all Human Genetics faculty groups.
The SAM team is responsible for archival of the data produced by NPG to external repositories, typically the EGA, ENA or ArrayExpress at the EBI. They also create and maintain tools which are essential components of our analysis pipelines.