Javascript must be enabled to continue!
Evaluation of Metagenome Binning: Advances and Challenges
View through CrossRef
Abstract
Background
Several recent deep learning methods for metagenome binning claim improvements in the recovery of high quality metagenome-assembled genomes. These methods differ in their approaches to learn the contig embeddings and to cluster them. Rapid advances in binning require rigorous benchmarking to evaluate the effectiveness of new methods. We have benchmarked newly developed state-of-the-art deep learning binners on CAMI2 datasets, including our own, McDevol.
Results
The results show that COMEBin and GenomeFace give the best binning accuracy, although not always the best embedding accuracy. Interestingly, post-binning reassembly consistently improves the quality of low coverage bins. We find that binning coassembled contigs with multi-sample coverage is effective for low coverage dataset while binning multi-sample contigs with multi-sample coverage (‘multi-sample’) is effective for high-coverage samples. In multi-sample binning, splitting the embedding space by sample before clustering showed enhanced performance compared to the standard approach of splitting final clusters by sample.
Conclusions
COMEBin and GenomeFace emerged as the top-performing tools overall, with MetaBAT2 and GenomeFace demonstrating superior speed. To facilitate future development, we provide workflows for standardized benchmarking of metagenome binners.
Title: Evaluation of Metagenome Binning: Advances and Challenges
Description:
Abstract
Background
Several recent deep learning methods for metagenome binning claim improvements in the recovery of high quality metagenome-assembled genomes.
These methods differ in their approaches to learn the contig embeddings and to cluster them.
Rapid advances in binning require rigorous benchmarking to evaluate the effectiveness of new methods.
We have benchmarked newly developed state-of-the-art deep learning binners on CAMI2 datasets, including our own, McDevol.
Results
The results show that COMEBin and GenomeFace give the best binning accuracy, although not always the best embedding accuracy.
Interestingly, post-binning reassembly consistently improves the quality of low coverage bins.
We find that binning coassembled contigs with multi-sample coverage is effective for low coverage dataset while binning multi-sample contigs with multi-sample coverage (‘multi-sample’) is effective for high-coverage samples.
In multi-sample binning, splitting the embedding space by sample before clustering showed enhanced performance compared to the standard approach of splitting final clusters by sample.
Conclusions
COMEBin and GenomeFace emerged as the top-performing tools overall, with MetaBAT2 and GenomeFace demonstrating superior speed.
To facilitate future development, we provide workflows for standardized benchmarking of metagenome binners.
Related Results
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies
We previously reported MetaBAT, an automated metagenome binning software tool to reconstruct single genomes from microbial communities for subsequent analyses of uncultivated micro...
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
GraphK-LR: Enhancing Long-read Metagenomic Binning with Read-overlap Graphs Across Microbial Kingdoms
Abstract
Background: Metagenomics, the study of genetic material from environmental samples, relies on binning - the process of grouping DNA sequences from the same organis...
Evaluation of metagenome binning: advances and challenges
Evaluation of metagenome binning: advances and challenges
Abstract
Several recent deep learning methods for metagenome binning claim improvements in the recovery of high-quality metagenome-assembled genomes. These method...
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
Effect of data binning and frame averaging for micro-CT image acquisition on the morphometric outcome of bone repair assessment
AbstractDespite the current advances in micro-CT analysis, the influence of some image acquisition parameters on the morphometric assessment outcome have not been fully elucidated....
CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
CoCoBin: Graph-Based Metagenomic Binning via Composition–Coverage Separation
Abstract
Motivation
Metagenomic binning is a critical step in metagenomic analysis, aiming to cluster contigs from the same genome into c...
Pixel Binning in Digital Radiography Imaging
Pixel Binning in Digital Radiography Imaging
In digital radiography imaging, pixel binning is an effective way to reduce the amount of image data for transmission or storage, and is particularly effective for application to d...
binny: an automated binning algorithm to recover high-quality genomes from complex metagenomic datasets
binny: an automated binning algorithm to recover high-quality genomes from complex metagenomic datasets
Abstract
The reconstruction of genomes is a critical step in genome-resolved metagenomics and for multi-omic data integration from microbial comm...
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Non-Recommended Publishing Lists: Strategies for Detecting Deceitful Journals
Abstract
The rapid growth of open access publishing (OAP) has significantly improved the accessibility and dissemination of scientific knowledge. However, this expansion has also c...

