Javascript must be enabled to continue!
Pangenome graph layout by Path-Guided Stochastic Gradient Descent
View through CrossRef
Abstract
Motivation
The increasing availability of complete genomes demands for models to study genomic variability within entire populations. Pangenome graphs capture the full genomic similarity and diversity between multiple genomes. In order to understand them, we need to see them. For visualization, we need a human readable graph layout: A graph embedding in low (e.g. two) dimensional depictions. Due to a pangenome graph’s potential excessive size, this is a significant challenge.
Results
In response, we introduce a novel graph layout algorithm: the Path-Guided Stochastic Gradient Descent (PG-SGD). PG-SGD uses the genomes, represented in the pangenome graph as paths, as an embedded positional system to sample genomic distances between pairs of nodes. This avoids the quadratic cost seen in previous versions of graph drawing by Stochastic Gradient Descent (SGD). We show that our implementation efficiently computes the low dimensional layouts of gigabase-scale pangenome graphs, unveiling their biological features.
Availability
We integrated PG-SGD in
ODGI
which is released as free software under the MIT open source license. Source code is available at
https://github.com/pangenome/odgi
.
Contact
egarris5@uthsc.edu
Title: Pangenome graph layout by Path-Guided Stochastic Gradient Descent
Description:
Abstract
Motivation
The increasing availability of complete genomes demands for models to study genomic variability within entire populations.
Pangenome graphs capture the full genomic similarity and diversity between multiple genomes.
In order to understand them, we need to see them.
For visualization, we need a human readable graph layout: A graph embedding in low (e.
g.
two) dimensional depictions.
Due to a pangenome graph’s potential excessive size, this is a significant challenge.
Results
In response, we introduce a novel graph layout algorithm: the Path-Guided Stochastic Gradient Descent (PG-SGD).
PG-SGD uses the genomes, represented in the pangenome graph as paths, as an embedded positional system to sample genomic distances between pairs of nodes.
This avoids the quadratic cost seen in previous versions of graph drawing by Stochastic Gradient Descent (SGD).
We show that our implementation efficiently computes the low dimensional layouts of gigabase-scale pangenome graphs, unveiling their biological features.
Availability
We integrated PG-SGD in
ODGI
which is released as free software under the MIT open source license.
Source code is available at
https://github.
com/pangenome/odgi
.
Contact
egarris5@uthsc.
edu.
Related Results
Disentangling the impacts of abiotic and biotic environmental factors and dispersal dynamics on the pangenome fluidity of bacterial pathogens
Disentangling the impacts of abiotic and biotic environmental factors and dispersal dynamics on the pangenome fluidity of bacterial pathogens
ABSTRACT
Understanding how pangenomes originate and evolve is crucial for predicting evolutionary trajectories and uncovering ecological interact...
Cluster-efficient pangenome graph construction with nf-core/pangenome
Cluster-efficient pangenome graph construction with nf-core/pangenome
Abstract
Motivation
Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes. ...
Cluster efficient pangenome graph construction with nf-core/pangenome
Cluster efficient pangenome graph construction with nf-core/pangenome
Abstract
Motivation
Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes. Howeve...
SVPG: A pangenome-based structural variant detection approach and rapid augmentation of pangenome graphs with new samples
SVPG: A pangenome-based structural variant detection approach and rapid augmentation of pangenome graphs with new samples
Abstract
Breakthrough advances in long-read sequencing technologies have opened unprecedented opportunities to study genetic variations through comprehensive pangen...
Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
Genomic characterization of the
C. tuberculostearicum
species complex, a ubiquitous member of the human skin microbiome
Genomic characterization of the
C. tuberculostearicum
species complex, a ubiquitous member of the human skin microbiome
ABSTRACT
Corynebacterium
is a predominant genus in the skin microbiome, yet its genetic diversity on skin is incompletely chara...
Bilangan Terhubung Titik Pelangi pada Graf Garis dan Graf Tengah dari Hasil Operasi Comb Graf Bintang C<sub>3</sub> dan Graf Bintang S<sub>n</sub>
Bilangan Terhubung Titik Pelangi pada Graf Garis dan Graf Tengah dari Hasil Operasi Comb Graf Bintang C<sub>3</sub> dan Graf Bintang S<sub>n</sub>
Penelitian ini bertujuan menentukan bilangan terhubung titik pelangi (rainbow vertex connection number) pada graf garis dan graf tengah yang diperoleh dari hasil operasi comb antar...
ODGI: understanding pangenome graphs
ODGI: understanding pangenome graphs
Abstract
Motivation
Pangenome graphs provide a complete representation of the mutual alignment of collections of genomes. These...

