Javascript must be enabled to continue!
Metagenomic binning with assembly graph embeddings
View through CrossRef
Abstract
Despite recent advancements in sequencing technologies and assembly methods, obtaining high-quality microbial genomes from metagenomic samples is still not a trivial task. Current metagenomic binners do not take full advantage of assembly graphs and are not optimized for long-read assemblies. Deep graph learning algorithms have been proposed in other fields to deal with complex graph data structures. The graph structure generated during the assembly process could be integrated with contig features to obtain better bins with deep learning.
We propose GraphMB, which uses graph neural networks to incorporate the assembly graph into the binning process. We test GraphMB on long-read datasets of different complexities, and compare the performance with other binners in terms of the number of High Quality (HQ) genome bins obtained. With our approach, we were able to obtain unique bins on all real datasets, and obtain more bins on most datasets. In particular, we obtained on average 17.5% more HQ bins when compared to state-of-the-art binners and 13.7% when aggregating the results of our binner with the others. These results indicate that a deep learning model can integrate contig-specific and graph-structure information to improve metagenomic binning. GraphMB is available from
https://github.com/MicrobialDarkMatter/GraphMB
.
Title: Metagenomic binning with assembly graph embeddings
Description:
Abstract
Despite recent advancements in sequencing technologies and assembly methods, obtaining high-quality microbial genomes from metagenomic samples is still not a trivial task.
Current metagenomic binners do not take full advantage of assembly graphs and are not optimized for long-read assemblies.
Deep graph learning algorithms have been proposed in other fields to deal with complex graph data structures.
The graph structure generated during the assembly process could be integrated with contig features to obtain better bins with deep learning.
We propose GraphMB, which uses graph neural networks to incorporate the assembly graph into the binning process.
We test GraphMB on long-read datasets of different complexities, and compare the performance with other binners in terms of the number of High Quality (HQ) genome bins obtained.
With our approach, we were able to obtain unique bins on all real datasets, and obtain more bins on most datasets.
In particular, we obtained on average 17.
5% more HQ bins when compared to state-of-the-art binners and 13.
7% when aggregating the results of our binner with the others.
These results indicate that a deep learning model can integrate contig-specific and graph-structure information to improve metagenomic binning.
GraphMB is available from
https://github.
com/MicrobialDarkMatter/GraphMB
.
Related Results
CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
Abstract
Motivation
Metagenomic binning is a critical step in metagenomic analysis, aiming to cluster contigs from the same genome into c...
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract
Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
Evaluation of metagenome binning: advances and challenges
Evaluation of metagenome binning: advances and challenges
Abstract
Several recent deep learning methods for metagenome binning claim improvements in the recovery of high-quality metagenome-assembled genomes. These method...
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies
We previously reported MetaBAT, an automated metagenome binning software tool to reconstruct single genomes from microbial communities for subsequent analyses of uncultivated micro...
Evaluation of Metagenome Binning: Advances and Challenges
Evaluation of Metagenome Binning: Advances and Challenges
Abstract
Background
Several recent deep learning methods for metagenome binning claim improvements in the recovery of high qual...
Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
BusyBee Web: towards comprehensive and differential composition-based metagenomic binning
BusyBee Web: towards comprehensive and differential composition-based metagenomic binning
Abstract
Despite recent methodology and reference database improvements for taxonomic profiling tools, metagenomic assembly and genomic binning remain important pill...
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
AbstractDespite the current advances in micro-CT analysis, the influence of some image acquisition parameters on the morphometric assessment outcome have not been fully elucidated....

