Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Bacteria are everywhere, even in your COI marker gene data!

View through CrossRef
AbstractThe mitochondrial cytochrome C oxidase subunit I gene (COI) is commonly used in eDNA metabarcoding studies, especially for assessing metazoan diversity. Yet, a great number of COI operational taxonomic units or/and amplicon sequence variants are retrieved from such studies and referred to as “dark matter”, and do not get a taxonomic assignment with a reference sequence. For a thorough investigation of this dark matter, we have developed the Dark mAtteR iNvestigator (DARN) software tool. A reference COI-oriented phylogenetic tree was built from 1,240 consensus sequences covering all the three domains of life, with more than 80% of those representing eukaryotic taxa. With respect to eukaryotes, consensus sequences at the family level were constructed from 183,330 retrieved from the Midori reference 2 database. Similarly, sequences from 559 bacterial genera and 41 archaeal were retrieved from the BOLD database. DARN makes use of the phylogenetic tree to investigate and quantify pre-processed sequences of amplicon samples to provide both a tabular and a graphical overview of phylogenetic assignments. To evaluate DARN, both environmental and bulk metabarcoding samples from different aquatic environments using various primer sets were analysed. We demonstrate that a large proportion of non-target prokaryotic organisms such as bacteria and archaea are also amplified in eDNA samples and we suggest bacterial COI sequences to be included in the reference databases used for the taxonomy assignment to allow for further analyses of dark matter. DARN source code is available on GitHub at https://github.com/hariszaf/darn and you may find it as a Docker at https://hub.docker.com/r/hariszaf/darn.Author summaryDARN is a software approach aiming to provide further insight in the COI amplicon data coming from environmental samples. Building a COI-oriented reference phylogeny tree is a challenging task especially considering the small number of microbial curated COI sequences deposited in reference databases; e.g ~4,000 bacterial and ~150 archaeal in BOLD. Apparently, as more and more such sequences are collated, the DARN approach improves. To provide a more interactive way of communicating both our approach and our results, we strongly suggest the reader to visit this Google Collab notebook where all steps are described step by step and also this GitHub page where our results are demonstrated. Our approach corroborates the known presence of microbial sequences in COI environmental sequencing samples and highlights the need for curated bacterial and archaeal COI sequences and their integration into reference databases (i.e. Midori, BOLD, etc). We argue that DARN will benefit researchers as a quality control tool for their sequenced samples in terms of distinguishing eukaryotic from non-eukaryotic OTUs/ASVs, but also in terms of understanding the unknown unknowns.
Title: Bacteria are everywhere, even in your COI marker gene data!
Description:
AbstractThe mitochondrial cytochrome C oxidase subunit I gene (COI) is commonly used in eDNA metabarcoding studies, especially for assessing metazoan diversity.
Yet, a great number of COI operational taxonomic units or/and amplicon sequence variants are retrieved from such studies and referred to as “dark matter”, and do not get a taxonomic assignment with a reference sequence.
For a thorough investigation of this dark matter, we have developed the Dark mAtteR iNvestigator (DARN) software tool.
A reference COI-oriented phylogenetic tree was built from 1,240 consensus sequences covering all the three domains of life, with more than 80% of those representing eukaryotic taxa.
With respect to eukaryotes, consensus sequences at the family level were constructed from 183,330 retrieved from the Midori reference 2 database.
Similarly, sequences from 559 bacterial genera and 41 archaeal were retrieved from the BOLD database.
DARN makes use of the phylogenetic tree to investigate and quantify pre-processed sequences of amplicon samples to provide both a tabular and a graphical overview of phylogenetic assignments.
To evaluate DARN, both environmental and bulk metabarcoding samples from different aquatic environments using various primer sets were analysed.
We demonstrate that a large proportion of non-target prokaryotic organisms such as bacteria and archaea are also amplified in eDNA samples and we suggest bacterial COI sequences to be included in the reference databases used for the taxonomy assignment to allow for further analyses of dark matter.
DARN source code is available on GitHub at https://github.
com/hariszaf/darn and you may find it as a Docker at https://hub.
docker.
com/r/hariszaf/darn.
Author summaryDARN is a software approach aiming to provide further insight in the COI amplicon data coming from environmental samples.
Building a COI-oriented reference phylogeny tree is a challenging task especially considering the small number of microbial curated COI sequences deposited in reference databases; e.
g ~4,000 bacterial and ~150 archaeal in BOLD.
Apparently, as more and more such sequences are collated, the DARN approach improves.
To provide a more interactive way of communicating both our approach and our results, we strongly suggest the reader to visit this Google Collab notebook where all steps are described step by step and also this GitHub page where our results are demonstrated.
Our approach corroborates the known presence of microbial sequences in COI environmental sequencing samples and highlights the need for curated bacterial and archaeal COI sequences and their integration into reference databases (i.
e.
Midori, BOLD, etc).
We argue that DARN will benefit researchers as a quality control tool for their sequenced samples in terms of distinguishing eukaryotic from non-eukaryotic OTUs/ASVs, but also in terms of understanding the unknown unknowns.

Related Results

Expression and polymorphism of genes in gallstones
Expression and polymorphism of genes in gallstones
ABSTRACT Through the method of clinical case control study, to explore the expression and genetic polymorphism of KLF14 gene (rs4731702 and rs972283) and SR-B1 gene (rs...
Effect of Gram-positive bacteria on antibiotic resistance in Gram-negative bacteria
Effect of Gram-positive bacteria on antibiotic resistance in Gram-negative bacteria
Antibiotics are one of the most common treatments for bacterial infections, but the emergence of antibiotic resistance is a major threat to the control of infectious diseases. Many...
Evolution of Antimicrobial Resistance in Community vs. Hospital-Acquired Infections
Evolution of Antimicrobial Resistance in Community vs. Hospital-Acquired Infections
Abstract Introduction Hospitals are high-risk environments for infections. Despite the global recognition of these pathogens, few studies compare microorganisms from community-acqu...
Distribution of pathogenic bacteria and antimicrobial sensitivity of eye infections in Suzhou
Distribution of pathogenic bacteria and antimicrobial sensitivity of eye infections in Suzhou
AIM: To investigate the types of bacteria in patients with eye infections in Suzhou and their drug resistance to commonly used antibacterial drugs. METHODS: The clinical data of 15...
A Study on Bacteria in Saliva of Autistic Children at Early Life
A Study on Bacteria in Saliva of Autistic Children at Early Life
Background: Studies have shown that oral bacteria are involved in the occurrence of some neurological diseases. Autism spectrum disorder (ASD), a neurodevelopmental disorder occurr...

Back to Top