Javascript must be enabled to continue!
Cont-ID: Detection of samples cross-contamination in viral metagenomic data
View through CrossRef
BackgroundHigh Throughput sequencing (HTS) technologies completed by the bioinformatic analysis of the generated data are becoming an important detection technique for virus diagnostics. They have the potential to replace or complement the current PCR-based methods thanks to their improved inclusivity and analytical sensitivity, as well as their overall good repeatability and reproducibility. Cross-contamination is a well-known phenomenon in molecular diagnostics and corresponds to the exchange of genetic material between samples. Cross-contamination management was a key drawback during the development of PCR-based detection and is now adequately monitored in routine diagnostics. HTS technologies are facing similar difficulties due to their very high analytical sensitivity. As a single viral read could be detected in millions of sequencing reads, it is mandatory to fix a detection threshold that will be influenced by cross-contamination. Cross-contamination monitoring should therefore be a priority when detecting viruses by HTS technologies.ResultsWe present Cont-ID, a bioinformatic tool designed to check for cross-contamination by analysing the relative abundance of virus sequencing reads identified in sequence metagenomic datasets and their duplication between samples. It can be applied when the samples in a sequencing batch have been processed in parallel in the laboratory and with at least one external alien control. Using 273 real datasets, including 68 virus species from different hosts (fruit tree, plant, human) and several library preparation protocols (Ribodepleted total RNA, small RNA and double stranded RNA), we demonstrated that Cont-ID classifies with high accuracy (91%) viral species detection into (true) infection or (cross) contamination. This classification raises confidence in the detection and facilitates the downstream interpretation and confirmation of the results by prioritising the virus detections that should be confirmed.ConclusionsCross-contamination between samples when detecting viruses using HTS can be monitored and highlighted by Cont-ID (provided an alien control is present). Cont-ID is based on a flexible methodology relying on the output of bioinformatics analyses of the sequencing reads and considering the contamination pattern specific to each batch of samples. The Cont-ID method is adaptable so that each laboratory can optimise it before its validation and routine use.
Title: Cont-ID: Detection of samples cross-contamination in viral metagenomic data
Description:
BackgroundHigh Throughput sequencing (HTS) technologies completed by the bioinformatic analysis of the generated data are becoming an important detection technique for virus diagnostics.
They have the potential to replace or complement the current PCR-based methods thanks to their improved inclusivity and analytical sensitivity, as well as their overall good repeatability and reproducibility.
Cross-contamination is a well-known phenomenon in molecular diagnostics and corresponds to the exchange of genetic material between samples.
Cross-contamination management was a key drawback during the development of PCR-based detection and is now adequately monitored in routine diagnostics.
HTS technologies are facing similar difficulties due to their very high analytical sensitivity.
As a single viral read could be detected in millions of sequencing reads, it is mandatory to fix a detection threshold that will be influenced by cross-contamination.
Cross-contamination monitoring should therefore be a priority when detecting viruses by HTS technologies.
ResultsWe present Cont-ID, a bioinformatic tool designed to check for cross-contamination by analysing the relative abundance of virus sequencing reads identified in sequence metagenomic datasets and their duplication between samples.
It can be applied when the samples in a sequencing batch have been processed in parallel in the laboratory and with at least one external alien control.
Using 273 real datasets, including 68 virus species from different hosts (fruit tree, plant, human) and several library preparation protocols (Ribodepleted total RNA, small RNA and double stranded RNA), we demonstrated that Cont-ID classifies with high accuracy (91%) viral species detection into (true) infection or (cross) contamination.
This classification raises confidence in the detection and facilitates the downstream interpretation and confirmation of the results by prioritising the virus detections that should be confirmed.
ConclusionsCross-contamination between samples when detecting viruses using HTS can be monitored and highlighted by Cont-ID (provided an alien control is present).
Cont-ID is based on a flexible methodology relying on the output of bioinformatics analyses of the sequencing reads and considering the contamination pattern specific to each batch of samples.
The Cont-ID method is adaptable so that each laboratory can optimise it before its validation and routine use.
Related Results
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Abstract
Introduction
Echinococcosis, caused by tapeworms of the Echinococcus genus, remains a significant zoonotic disease globally. The disease is particularly prevalent in areas...
CAIM: Coverage-based Analysis for Identification of Microbiome
CAIM: Coverage-based Analysis for Identification of Microbiome
ABSTRACTAccurate taxonomic profiling of microbial taxa in a metagenomic sample is vital to gain insights into microbial ecology. Recent advancements in sequencing technologies have...
Metagenomic Thermometer
Metagenomic Thermometer
AbstractVarious microorganisms exist in environments, and each of which has an optimal growth temperature (OGT). The relationship between genomic information and OGT of each specie...
543-P: A Randomised Control Trial to Evaluate the Efficacy of Chinese Traditional Medicine for the Treatment of Diabetic Peripheral Neuropathy
543-P: A Randomised Control Trial to Evaluate the Efficacy of Chinese Traditional Medicine for the Treatment of Diabetic Peripheral Neuropathy
Background: Diabetic peripheral neuropathy (DPN) is a common complication in patients (pts) with diabetes. However, multinational data do not support any convincing efficacy of the...
Viral Hijacking of Host RNA-Binding Proteins: Implications for Viral Replication and Pathogenesis
Viral Hijacking of Host RNA-Binding Proteins: Implications for Viral Replication and Pathogenesis
In the intricate dance between viruses and host cells, RNA-binding proteins (RBPs) serve as crucial orchestrators of gene expression and cellular processes. We will delve into the ...
Metagenomic Thermometer
Metagenomic Thermometer
Abstract
Various microorganisms exist in environments, and each of them has its optimal growth temperature (OGT). The relationship between genomic information and OG...
Bioinformatics analysis and collection of protein post-translational modification sites in human viruses
Bioinformatics analysis and collection of protein post-translational modification sites in human viruses
AbstractIn viruses, post-translational modifications (PTMs) are essential for their life cycle. Recognizing viral PTMs is very important for better understanding the mechanism of v...
LMAS: evaluating metagenomic short de novo assembly methods through defined communities
LMAS: evaluating metagenomic short de novo assembly methods through defined communities
Abstract
Background
The de novo assembly of raw sequence data is key in metagenomic analysis. It allows recovering draft genomes...

