Javascript must be enabled to continue!
CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
View through CrossRef
Abstract
Motivation
Metagenomic binning is a critical step in metagenomic analysis, aiming to cluster contigs from the same genome into coherent groups. In contemporary workflows, most binning tools begin with the assembly of shotgun metagenomic sequencing data. The assembled contigs are then grouped into bins representing individual microbial genomes or species, typically using taxonomy-independent methods. Although several methods exist, metagenomic binning remains a challenging yet mandatory task, particularly in the context of complex and highly diverse microbial communities.
Results
We propose CoCoBin, a novel metagenomic binning tool explicitly designed for the effective binning of metagenomic contigs. In this study, we introduced an innovative approach for calculating contig similarity by separating composition and coverage information. The method begins by (1) assigning contigs into a cluster based on length ranges, (2) calculating contig similarity based on composition features (e.g., k-mer frequencies), and (3) calculating contig difference based on coverage features. These similarity measures are then integrated to construct a graph, where nodes represent contigs and edges represent the similarities between them. Finally, the Louvain algorithm is applied to the graph to cluster closely related contigs. CoCoBin was compared against several state-of-the-art binning tools: BusyBee Web, CONCOCT, MaxBin 2.0, MetaBAT 2, and MetaDecoder on nine simulated datasets, five mock community datasets, and one real dataset. The AMBER tool used to evaluate the binning results across all datasets shows that CoCoBin achieved the best performance regarding the number of bins identified, followed by its performance on the F1 score.
Availability
The source code of CoCoBin is available at https://github.com/cucpbioinfo/CoCoBin
Contact
duangdao.w@chula.ac.th
Supplementary information
Supplementary data are available at Bioinformatics online.
Title: CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
Description:
Abstract
Motivation
Metagenomic binning is a critical step in metagenomic analysis, aiming to cluster contigs from the same genome into coherent groups.
In contemporary workflows, most binning tools begin with the assembly of shotgun metagenomic sequencing data.
The assembled contigs are then grouped into bins representing individual microbial genomes or species, typically using taxonomy-independent methods.
Although several methods exist, metagenomic binning remains a challenging yet mandatory task, particularly in the context of complex and highly diverse microbial communities.
Results
We propose CoCoBin, a novel metagenomic binning tool explicitly designed for the effective binning of metagenomic contigs.
In this study, we introduced an innovative approach for calculating contig similarity by separating composition and coverage information.
The method begins by (1) assigning contigs into a cluster based on length ranges, (2) calculating contig similarity based on composition features (e.
g.
, k-mer frequencies), and (3) calculating contig difference based on coverage features.
These similarity measures are then integrated to construct a graph, where nodes represent contigs and edges represent the similarities between them.
Finally, the Louvain algorithm is applied to the graph to cluster closely related contigs.
CoCoBin was compared against several state-of-the-art binning tools: BusyBee Web, CONCOCT, MaxBin 2.
0, MetaBAT 2, and MetaDecoder on nine simulated datasets, five mock community datasets, and one real dataset.
The AMBER tool used to evaluate the binning results across all datasets shows that CoCoBin achieved the best performance regarding the number of bins identified, followed by its performance on the F1 score.
Availability
The source code of CoCoBin is available at https://github.
com/cucpbioinfo/CoCoBin
Contact
duangdao.
w@chula.
ac.
th
Supplementary information
Supplementary data are available at Bioinformatics online.
Related Results
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract
Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
CAIM: Coverage-based Analysis for Identification of Microbiome
CAIM: Coverage-based Analysis for Identification of Microbiome
ABSTRACTAccurate taxonomic profiling of microbial taxa in a metagenomic sample is vital to gain insights into microbial ecology. Recent advancements in sequencing technologies have...
BusyBee Web: towards comprehensive and differential composition-based metagenomic binning
BusyBee Web: towards comprehensive and differential composition-based metagenomic binning
Abstract
Despite recent methodology and reference database improvements for taxonomic profiling tools, metagenomic assembly and genomic binning remain important pill...
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
AbstractDespite the current advances in micro-CT analysis, the influence of some image acquisition parameters on the morphometric assessment outcome have not been fully elucidated....
Metagenomic Thermometer
Metagenomic Thermometer
AbstractVarious microorganisms exist in environments, and each of which has an optimal growth temperature (OGT). The relationship between genomic information and OGT of each specie...
Pixel Binning in Digital Radiography Imaging
Pixel Binning in Digital Radiography Imaging
In digital radiography imaging, pixel binning is an effective way to reduce the amount of image data for transmission or storage, and is particularly effective for application to d...
Mapping the effective coverage of modern contraceptive services in Ethiopia
Mapping the effective coverage of modern contraceptive services in Ethiopia
Introduction
Modern contraceptive services are vital for reducing maternal and infant morbidity and mortality. However, in Ethiopia, the effective coverage (quality-adjusted covera...
Metagenomic Thermometer
Metagenomic Thermometer
Abstract
Various microorganisms exist in environments, and each of them has its optimal growth temperature (OGT). The relationship between genomic information and OG...

