Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Negligible effects of read trimming on the accuracy of germline short variant calling in the human genome

View through CrossRef
Next generation sequencing (NGS) has become a standard tool in the molecular diagnostics of Mendelian disease, and the precision of such diagnostics is greatly affected by the accuracy of variant calling from sequencing data. Recently, we have made a comprehensive evaluation of the performance of multiple variant calling pipelines, showing that state-of-the-art neural network-based methods show the best accuracy of variant discovery in the coding genome. In this work, we systematically evaluated the effects of adapters on the performance of variant calling tools using standard reference Genome-in-a-Bottle (GIAB) samples. We show that adapter trimming has no effect on the accuracy of the best-performing variant callers (e.g., DeepVariant) on whole-genome sequencing (WGS) data. For whole-exome sequencing (WES) datasets subtle improvement of accuracy was observed in some of the samples. In high-coverage WES data (~200x mean coverage), adapter removal allowed for discovery of 2-4 additional true positive variants in only two out of seven datasets tested. Moreover, this effect was not dependent on the median insert size and proportion of adapter sequences in reads. Surprisingly, the effect of trimming on variant calling was reversed when moderate coverage (~80-100x) WES data was used. Finally, we show that some of the recently developed machine learning-based variant callers demonstrate greater dependence on the presence of adapters in reads. Taken together, our results indicate that adapter removal is unnecessary when calling germline variants, but suggest that preprocessing methods should be carefully chosen when developing and using machine learning-based variant analysis methods.
Title: Negligible effects of read trimming on the accuracy of germline short variant calling in the human genome
Description:
Next generation sequencing (NGS) has become a standard tool in the molecular diagnostics of Mendelian disease, and the precision of such diagnostics is greatly affected by the accuracy of variant calling from sequencing data.
Recently, we have made a comprehensive evaluation of the performance of multiple variant calling pipelines, showing that state-of-the-art neural network-based methods show the best accuracy of variant discovery in the coding genome.
In this work, we systematically evaluated the effects of adapters on the performance of variant calling tools using standard reference Genome-in-a-Bottle (GIAB) samples.
We show that adapter trimming has no effect on the accuracy of the best-performing variant callers (e.
g.
, DeepVariant) on whole-genome sequencing (WGS) data.
For whole-exome sequencing (WES) datasets subtle improvement of accuracy was observed in some of the samples.
In high-coverage WES data (~200x mean coverage), adapter removal allowed for discovery of 2-4 additional true positive variants in only two out of seven datasets tested.
Moreover, this effect was not dependent on the median insert size and proportion of adapter sequences in reads.
Surprisingly, the effect of trimming on variant calling was reversed when moderate coverage (~80-100x) WES data was used.
Finally, we show that some of the recently developed machine learning-based variant callers demonstrate greater dependence on the presence of adapters in reads.
Taken together, our results indicate that adapter removal is unnecessary when calling germline variants, but suggest that preprocessing methods should be carefully chosen when developing and using machine learning-based variant analysis methods.

Related Results

Negligible effects of read trimming on the accuracy of germline short variant calling in the human genome
Negligible effects of read trimming on the accuracy of germline short variant calling in the human genome
Background Next generation sequencing (NGS) has become a standard tool in the molecular diagnostics of Mendelian disease, and the precision of such diagnostics is greatly affected ...
[RETRACTED] Keanu Reeves CBD Gummies v1
[RETRACTED] Keanu Reeves CBD Gummies v1
[RETRACTED]Keanu Reeves CBD Gummies ==❱❱ Huge Discounts:[HURRY UP ] Absolute Keanu Reeves CBD Gummies (Available)Order Online Only!! ❰❰= https://www.facebook.com/Keanu-Reeves-CBD-G...
Genomic sequence characteristics and the empiric accuracy of short-read sequencing
Genomic sequence characteristics and the empiric accuracy of short-read sequencing
Abstract Background Short-read whole genome sequencing (WGS) is a vital tool for clinical applications and basic research. Gene...
Abstract 4177: Genetic testing for hereditary colorectal cancer syndromes in Algerian patients: A multicenter study
Abstract 4177: Genetic testing for hereditary colorectal cancer syndromes in Algerian patients: A multicenter study
Abstract Background To date, 5% to 6 % of all colorectal cancers (CRCs) are associated with germline pathogenic variants in cancer predisposition genes that confer i...
Mechanism of Tripeptide Trimming by γ-Secretase
Mechanism of Tripeptide Trimming by γ-Secretase
Abstract The membrane-embedded γ-secretase complex processively cleaves within the transmembrane domain of amyloid precursor protein (APP) to pro...
The impact of perceived calling on work outcomes in a nursing context: The role of career commitment and living one’s calling
The impact of perceived calling on work outcomes in a nursing context: The role of career commitment and living one’s calling
AbstractThe current study examined the impact of perceived calling on nurses’ organizational commitment, organizational citizenship behavior, workplace deviant behavior, and turnov...

Back to Top