Javascript must be enabled to continue!
Cluster efficient pangenome graph construction with nf-core/pangenome
View through CrossRef
Abstract
Motivation
Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes. However, current construction methods often introduce biases, excluding complex sequences or relying on references. The PanGenome Graph Builder (PGGB) addresses these issues. To date, though, there is no state-of-the-art pipeline allowing for easy deployment, efficient and dynamic use of available resources, and scalable usage at the same time.
Results
To overcome these limitations, we present
nf-core/pangenome
, a reference-unbiased approach implemented in Nextflow following nf-core’s best practices. Leveraging biocontainers ensures portability and seamless deployment in HPC environments. Unlike PGGB, nf-core/pangenome distributes alignments across cluster nodes, enabling scalability. Demonstrating its efficiency, we constructed pangenome graphs for 1000 human chromosome 19 haplotypes and 2146
E. coli
sequences, achieving a two to threefold speedup compared to PGGB without increasing greenhouse gas emissions.
Availability
nf-core/pangenome is released under the MIT open-source license, available on GitHub and Zenodo, with documentation accessible at
https://nf-co.re/pangenome/1.1.2/docs/usage
.
Contact
simon.heumos@qbic.uni-tuebingen.de
,
sven.nahnsen@qbic.uni-tuebingen.de
Title: Cluster efficient pangenome graph construction with nf-core/pangenome
Description:
Abstract
Motivation
Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes.
However, current construction methods often introduce biases, excluding complex sequences or relying on references.
The PanGenome Graph Builder (PGGB) addresses these issues.
To date, though, there is no state-of-the-art pipeline allowing for easy deployment, efficient and dynamic use of available resources, and scalable usage at the same time.
Results
To overcome these limitations, we present
nf-core/pangenome
, a reference-unbiased approach implemented in Nextflow following nf-core’s best practices.
Leveraging biocontainers ensures portability and seamless deployment in HPC environments.
Unlike PGGB, nf-core/pangenome distributes alignments across cluster nodes, enabling scalability.
Demonstrating its efficiency, we constructed pangenome graphs for 1000 human chromosome 19 haplotypes and 2146
E.
coli
sequences, achieving a two to threefold speedup compared to PGGB without increasing greenhouse gas emissions.
Availability
nf-core/pangenome is released under the MIT open-source license, available on GitHub and Zenodo, with documentation accessible at
https://nf-co.
re/pangenome/1.
1.
2/docs/usage
.
Contact
simon.
heumos@qbic.
uni-tuebingen.
de
,
sven.
nahnsen@qbic.
uni-tuebingen.
de.
Related Results
Disentangling the impacts of abiotic and biotic environmental factors and dispersal dynamics on the pangenome fluidity of bacterial pathogens
Disentangling the impacts of abiotic and biotic environmental factors and dispersal dynamics on the pangenome fluidity of bacterial pathogens
ABSTRACT
Understanding how pangenomes originate and evolve is crucial for predicting evolutionary trajectories and uncovering ecological interact...
Cluster-efficient pangenome graph construction with nf-core/pangenome
Cluster-efficient pangenome graph construction with nf-core/pangenome
Abstract
Motivation
Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes. ...
Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
SVPG: A pangenome-based structural variant detection approach and rapid augmentation of pangenome graphs with new samples
SVPG: A pangenome-based structural variant detection approach and rapid augmentation of pangenome graphs with new samples
Abstract
Breakthrough advances in long-read sequencing technologies have opened unprecedented opportunities to study genetic variations through comprehensive pangen...
Pangenome graph layout by Path-Guided Stochastic Gradient Descent
Pangenome graph layout by Path-Guided Stochastic Gradient Descent
Abstract
Motivation
The increasing availability of complete genomes demands for models to study genomic variability within enti...
Constructing a VANET based on cluster chains
Constructing a VANET based on cluster chains
SUMMARYThe paper proposes a scheme on constructing a vehicular ad‐hoc network based on cluster chains. In the cluster construction algorithm, the distance from a potential cluster ...
Inheritance of Cluster Headache and its Possible Link to Migraine
Inheritance of Cluster Headache and its Possible Link to Migraine
SYNOPSIS
We evaluated the possibility that cluster headache may be a transmitted disorder, influenced by migraine genetics. In the first part of a two part study,...
Genomic characterization of the
C. tuberculostearicum
species complex, a ubiquitous member of the human skin microbiome
Genomic characterization of the
C. tuberculostearicum
species complex, a ubiquitous member of the human skin microbiome
ABSTRACT
Corynebacterium
is a predominant genus in the skin microbiome, yet its genetic diversity on skin is incompletely chara...

