Javascript must be enabled to continue!
CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
View through CrossRef
Abstract
Motivation
Metagenomic binning is a critical step in metagenomic analysis, aiming to cluster contigs from the same genome into coherent groups. In contemporary workflows, most binning tools begin with the assembly of shotgun metagenomic sequencing data. The assembled contigs are then grouped into bins representing individual microbial genomes or species, typically using taxonomy-independent methods. Although several methods exist, metagenomic binning remains a challenging yet mandatory task, particularly in the context of complex and highly diverse microbial communities.
Results
We propose CoCoBin, a novel metagenomic binning tool explicitly designed for the effective binning of metagenomic contigs. In this study, we introduced an innovative approach for calculating contig similarity by separating composition and coverage information. The method begins by (1) assigning contigs into a cluster based on length ranges, (2) calculating contig similarity based on composition features (e.g., k-mer frequencies), and (3) calculating contig difference based on coverage features. These similarity measures are then integrated to construct a graph, where nodes represent contigs and edges represent the similarities between them. Finally, the Louvain algorithm is applied to the graph to cluster closely related contigs. CoCoBin was compared against several state-of-the-art binning tools: BusyBee Web, CONCOCT, MaxBin 2.0, MetaBAT 2, and MetaDecoder on nine simulated datasets, five mock community datasets, and one real dataset. The AMBER tool used to evaluate the binning results across all datasets shows that CoCoBin achieved the best performance regarding the number of bins identified, followed by its performance on the F1 score.
Availability
The source code of CoCoBin is available at https://github.com/cucpbioinfo/CoCoBin
Contact
duangdao.w@chula.ac.th
Supplementary information
Supplementary data are available at Bioinformatics online.
Title: CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
Description:
Abstract
Motivation
Metagenomic binning is a critical step in metagenomic analysis, aiming to cluster contigs from the same genome into coherent groups.
In contemporary workflows, most binning tools begin with the assembly of shotgun metagenomic sequencing data.
The assembled contigs are then grouped into bins representing individual microbial genomes or species, typically using taxonomy-independent methods.
Although several methods exist, metagenomic binning remains a challenging yet mandatory task, particularly in the context of complex and highly diverse microbial communities.
Results
We propose CoCoBin, a novel metagenomic binning tool explicitly designed for the effective binning of metagenomic contigs.
In this study, we introduced an innovative approach for calculating contig similarity by separating composition and coverage information.
The method begins by (1) assigning contigs into a cluster based on length ranges, (2) calculating contig similarity based on composition features (e.
g.
, k-mer frequencies), and (3) calculating contig difference based on coverage features.
These similarity measures are then integrated to construct a graph, where nodes represent contigs and edges represent the similarities between them.
Finally, the Louvain algorithm is applied to the graph to cluster closely related contigs.
CoCoBin was compared against several state-of-the-art binning tools: BusyBee Web, CONCOCT, MaxBin 2.
0, MetaBAT 2, and MetaDecoder on nine simulated datasets, five mock community datasets, and one real dataset.
The AMBER tool used to evaluate the binning results across all datasets shows that CoCoBin achieved the best performance regarding the number of bins identified, followed by its performance on the F1 score.
Availability
The source code of CoCoBin is available at https://github.
com/cucpbioinfo/CoCoBin
Contact
duangdao.
w@chula.
ac.
th
Supplementary information
Supplementary data are available at Bioinformatics online.
Related Results
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract
Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
Evaluation of metagenome binning: advances and challenges
Evaluation of metagenome binning: advances and challenges
Abstract
Several recent deep learning methods for metagenome binning claim improvements in the recovery of high-quality metagenome-assembled genomes. These method...
Evaluation of Metagenome Binning: Advances and Challenges
Evaluation of Metagenome Binning: Advances and Challenges
Abstract
Background
Several recent deep learning methods for metagenome binning claim improvements in the recovery of high qual...
BusyBee Web: towards comprehensive and differential composition-based metagenomic binning
BusyBee Web: towards comprehensive and differential composition-based metagenomic binning
Abstract
Despite recent methodology and reference database improvements for taxonomic profiling tools, metagenomic assembly and genomic binning remain important pill...
CAIM: Coverage-based Analysis for Identification of Microbiome
CAIM: Coverage-based Analysis for Identification of Microbiome
ABSTRACT
Accurate taxonomic profiling of microbial taxa in a metagenomic sample is vital to gain insights into microbial ecology. Recent advancements in sequencing ...
Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
AbstractDespite the current advances in micro-CT analysis, the influence of some image acquisition parameters on the morphometric assessment outcome have not been fully elucidated....
Metagenomic binning with assembly graph embeddings
Metagenomic binning with assembly graph embeddings
Abstract
Despite recent advancements in sequencing technologies and assembly methods, obtaining high-quality microbial genomes from metagenomic samples is still not...

