Javascript must be enabled to continue!
COBRA improves the quality of viral genomes assembled from metagenomes
View through CrossRef
AbstractMicrobial and viral diversity, distribution, and ecological impacts are often studied using metagenome-assembled sequences, but genome incompleteness hampers comprehensive and accurate analyses. Here we introduce COBRA (ContigOverlapBasedRe-Assembly), a tool that resolvesde Bruijngraph based assembly breakpoints and joins contigs. While applicable to any short-read assembled DNA sequences, we benchmarked COBRA by using a dataset of published complete viral genomes from the ocean. COBRA accurately joined contigs assembled by metaSPAdes, IDBA_UD, and MEGAHIT, outcompeting several existing binning tools and achieving significantly higher genome accuracy (96.6% vs 19.8-59.6%). We applied COBRA to viral contigs that we assembled from 231 published freshwater metagenomes and obtained 7,334 high-quality or complete species-level genomes (clusters with 95% average nucleotide identity) for viruses of bacteria (phages), ∼83% of which represent new phage species. Notably, ∼70% of the 7,334 species genomes were circular, compared to 34% before COBRA analyses. We expanded genomic sampling of ≥ 200 kbp phages (i.e., huge phages), the largest of which was curated to completion (717 kbp in length). The improved phage genomes from Rotsee Lake provided context for metatranscriptomic data and indicatedin situactivity of huge phages, WhiB andcysC/cysHencoding phages from this site. In conclusion, COBRA improves the assembly contiguity and completeness of microbial and viral genomes and thus, the accuracy and reliability of analyses of gene content, diversity, and evolution.
Title: COBRA improves the quality of viral genomes assembled from metagenomes
Description:
AbstractMicrobial and viral diversity, distribution, and ecological impacts are often studied using metagenome-assembled sequences, but genome incompleteness hampers comprehensive and accurate analyses.
Here we introduce COBRA (ContigOverlapBasedRe-Assembly), a tool that resolvesde Bruijngraph based assembly breakpoints and joins contigs.
While applicable to any short-read assembled DNA sequences, we benchmarked COBRA by using a dataset of published complete viral genomes from the ocean.
COBRA accurately joined contigs assembled by metaSPAdes, IDBA_UD, and MEGAHIT, outcompeting several existing binning tools and achieving significantly higher genome accuracy (96.
6% vs 19.
8-59.
6%).
We applied COBRA to viral contigs that we assembled from 231 published freshwater metagenomes and obtained 7,334 high-quality or complete species-level genomes (clusters with 95% average nucleotide identity) for viruses of bacteria (phages), ∼83% of which represent new phage species.
Notably, ∼70% of the 7,334 species genomes were circular, compared to 34% before COBRA analyses.
We expanded genomic sampling of ≥ 200 kbp phages (i.
e.
, huge phages), the largest of which was curated to completion (717 kbp in length).
The improved phage genomes from Rotsee Lake provided context for metatranscriptomic data and indicatedin situactivity of huge phages, WhiB andcysC/cysHencoding phages from this site.
In conclusion, COBRA improves the assembly contiguity and completeness of microbial and viral genomes and thus, the accuracy and reliability of analyses of gene content, diversity, and evolution.
Related Results
Viral Hijacking of Host RNA-Binding Proteins: Implications for Viral Replication and Pathogenesis
Viral Hijacking of Host RNA-Binding Proteins: Implications for Viral Replication and Pathogenesis
In the intricate dance between viruses and host cells, RNA-binding proteins (RBPs) serve as crucial orchestrators of gene expression and cellular processes. We will delve into the ...
Bioinformatics analysis and collection of protein post-translational modification sites in human viruses
Bioinformatics analysis and collection of protein post-translational modification sites in human viruses
AbstractIn viruses, post-translational modifications (PTMs) are essential for their life cycle. Recognizing viral PTMs is very important for better understanding the mechanism of v...
Generating viral metagenomes from the coral holobiont v1
Generating viral metagenomes from the coral holobiont v1
Reef-building corals comprise multipartite symbioses where the cnidarian animal is host to an array of eukaryotic and prokaryotic organisms, and the viruses that infect them. These...
Reconstructing Prehistoric Viral Genomes from Neanderthal Sequencing Data
Reconstructing Prehistoric Viral Genomes from Neanderthal Sequencing Data
DNA viruses that produce persistent infections have been proposed as potential causes for the extinction of Neanderthals, and, therefore, the identification of viral genome remnant...
Reconstructing prehistoric viral genomes from Neanderthal sequencing data
Reconstructing prehistoric viral genomes from Neanderthal sequencing data
AbstractDNA viruses that produce persistent infections have been proposed as potential causes for the extinction of Neanderthals and therefore, the identification of viral genome r...
Epigenetic Repression of Herpes Simplex Virus Infection by the Nucleosome Remodeler CHD3
Epigenetic Repression of Herpes Simplex Virus Infection by the Nucleosome Remodeler CHD3
ABSTRACT
Upon infection, the genome of herpes simplex virus is rapidly incorporated into nucleosomes displaying histone modifications characteristic of heterochromatic st...
De novo extraction of microbial strains from metagenomes reveals intra-species niche partitioning
De novo extraction of microbial strains from metagenomes reveals intra-species niche partitioning
AbstractBackgroundWe introduceDESMANfor De novo Extraction of Strains from MetAgeNomes. Metagenome sequencing generates short reads from throughout the genomes of a microbial commu...
A systematic comparison of eight new plastome sequences from Ipomoea L
A systematic comparison of eight new plastome sequences from Ipomoea L
Background
Ipomoea is the largest genus in the family Convolvulaceae. The species in this genus have been widely used in many fields, such as agriculture, nutrition, and medicine. ...


