Javascript must be enabled to continue!
Pangenome graph layout by Path-Guided Stochastic Gradient Descent
View through CrossRef
Abstract
Motivation
The increasing availability of complete genomes demands for models to study genomic variability within entire populations. Pangenome graphs capture the full genomic similarity and diversity between multiple genomes. In order to understand them, we need to see them. For visualization, we need a human readable graph layout: A graph embedding in low (e.g. two) dimensional depictions. Due to a pangenome graph’s potential excessive size, this is a significant challenge.
Results
In response, we introduce a novel graph layout algorithm: the Path-Guided Stochastic Gradient Descent (PG-SGD). PG-SGD uses the genomes, represented in the pangenome graph as paths, as an embedded positional system to sample genomic distances between pairs of nodes. This avoids the quadratic cost seen in previous versions of graph drawing by Stochastic Gradient Descent (SGD). We show that our implementation efficiently computes the low dimensional layouts of gigabase-scale pangenome graphs, unveiling their biological features.
Availability
We integrated PG-SGD in
ODGI
which is released as free software under the MIT open source license. Source code is available at
https://github.com/pangenome/odgi
.
Contact
egarris5@uthsc.edu
Title: Pangenome graph layout by Path-Guided Stochastic Gradient Descent
Description:
Abstract
Motivation
The increasing availability of complete genomes demands for models to study genomic variability within entire populations.
Pangenome graphs capture the full genomic similarity and diversity between multiple genomes.
In order to understand them, we need to see them.
For visualization, we need a human readable graph layout: A graph embedding in low (e.
g.
two) dimensional depictions.
Due to a pangenome graph’s potential excessive size, this is a significant challenge.
Results
In response, we introduce a novel graph layout algorithm: the Path-Guided Stochastic Gradient Descent (PG-SGD).
PG-SGD uses the genomes, represented in the pangenome graph as paths, as an embedded positional system to sample genomic distances between pairs of nodes.
This avoids the quadratic cost seen in previous versions of graph drawing by Stochastic Gradient Descent (SGD).
We show that our implementation efficiently computes the low dimensional layouts of gigabase-scale pangenome graphs, unveiling their biological features.
Availability
We integrated PG-SGD in
ODGI
which is released as free software under the MIT open source license.
Source code is available at
https://github.
com/pangenome/odgi
.
Contact
egarris5@uthsc.
edu.
Related Results
Cluster-efficient pangenome graph construction with nf-core/pangenome
Cluster-efficient pangenome graph construction with nf-core/pangenome
Abstract
Motivation
Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes. ...
Cluster efficient pangenome graph construction with nf-core/pangenome
Cluster efficient pangenome graph construction with nf-core/pangenome
Abstract
Motivation
Pangenome graphs offer a comprehensive way of capturing genomic variability across multiple genomes. Howeve...
ODGI: understanding pangenome graphs
ODGI: understanding pangenome graphs
Abstract
Motivation
Pangenome graphs provide a complete representation of the mutual alignment of collections of genomes. These...
Domination of Polynomial with Application
Domination of Polynomial with Application
In this paper, .We .initiate the study of domination. polynomial , consider G=(V,E) be a simple, finite, and directed graph without. isolated. vertex .We present a study of the Ira...
Perancangan Tata Letak Fasilitas Metode CRAFT (Computerized Relative Allocation Facility Technique)
Perancangan Tata Letak Fasilitas Metode CRAFT (Computerized Relative Allocation Facility Technique)
Abstract. The layout of production facilities is a crucial factor in supporting the smooth operation of manufacturing processes. CV. XYZ faces issues related to inefficient facilit...
PanGraphViewer: A Versatile Tool to Visualize Pangenome Graphs
PanGraphViewer: A Versatile Tool to Visualize Pangenome Graphs
AbstractPangenome graphs provide a powerful way to present both sequence and structural features in a given genome relative to the typical features of a population. There are diffe...
Haplotype Matching with GBWT for Pangenome Graphs
Haplotype Matching with GBWT for Pangenome Graphs
Traditionally, variations from a linear reference genome were used to represent large sets of haplotypes compactly. In the linear reference genome based paradigm, the positional Bu...
Efficient inference of large pangenomes with PanTA
Efficient inference of large pangenomes with PanTA
Abstract
Pangenome analysis is an indispensable step in bacterial genomics to address the high variability of bacteria genomes. However, speed an...

