Archived site. See the  latest event.
Luke Paul Buttigieg
Błażej Szczerba
Several Speakers

Oct 19, 2023, 6:00 PM CET

Watch on YouTube

Porting workflow managers to Nextflow at a national diagnostic genomics medical service – strategy and learnings

Community

Luke-Paul Buttigieg, Blazej Szczerba, Ricardo Humberto Ramirez Gonzalez, and Edwin Clark

Genomics England provides whole genome sequencing diagnostics to the Genomic Medicine Service (U.K); a free at the point-of-care, nationwide, genomic diagnostic testing service, with the ambitious target of processing 300,000 samples by 2025. Currently, all clinical bioinformatic analysis is processed using a clinical-standard certified, internally developed workflow engine (Bertha). We are migrating to a new solution (Genie) which combines off-the-shelf products with custom functionality, so we can focus on our core mission to enable equitably accessed, genomics medicine for all. Genie should help us support newer use cases quicker, across different infrastructures (cloud and on-premise), and uses a standard workflow definition language.

We have developed an approach to migrate at speed in an agile and iterative fashion. The initial phase involves using the same Singularity image containing the bioinformatics workflow’s logic that Bertha uses, directly in Genie. This reduces the risk of divergence. We are using an automated comparison testing framework to compare the existing system with the new one to detect regressions. Later, we will iteratively refactor the workflows, breaking up the Singularity image and optimising for performance. We will focus on making the workflows portable, standardised, de-coupled and optimised for executing in the cloud with Nextflow.

In this talk, we will describe this migration strategy, risk management, refactoring strategy and lessons learnt while working through this large-scale effort.

Watch on YouTube
Luke Paul Buttigieg

Luke Paul Buttigieg

Senior Bioinformatics Engineer at Genomics England Ltd.

Community
Speaker
Błażej Szczerba

Błażej Szczerba

Senior Software Engineer at Ardigen

Ecosystem
Speaker
Ricardo Humberto Ramirez Gonzalez

Ricardo Humberto Ramirez Gonzalez

Senior Bioinformatics Engineer at Genomics England

Software
Speaker