nf-core/airrflow: A pipeline to analyze Adaptive Immune Receptor Repertoires (AIRRs)
Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) is a sequencing technique that allows obtaining the genetic code of specific receptors present on the surface of B and T lymphocytes. The collection of B or T-cell receptors in an individual is referred to as the “repertoire”. AIRR-seq can be used to study the immune state of individuals, identify signatures of immune responses, and guide the development of vaccines and antibody therapies.
We developed nf-core/airrflow, a scalable Nextflow pipeline to analyze high throughput AIRR-seq data from several NGS sequencing protocols. It uses the Immcantation framework for read quality control and assembly, V(D)J assignment with IgBlast, clonal assignment, and lineage tree reconstruction of bulk and single-cell repertoire data. The pipeline follows the nf-core best practices and can be easily ported to different compute environments including HPC clusters and commercial clouds.
Gisela Gabernet
Team Leader for Research & Development in Data Science at the Quantitative Biology Center, University of Tübingen