Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

How to “appreci8” Variant Calling in NGS Data

View through CrossRef
Background: Calling SNVs and indels in next-generation sequencing (NGS) data is a long-discussed topic. Numerous variant calling approaches are currently available. Despite considerable advances in recent years, detecting low-frequency variants with equally high sensitivity and positive predictive value (PPV) is still challenging. However, for the application of NGS in clinical routine it is essential to rely on valid variant calling results – for high- as well as low-frequency variants. Methods and results: In close collaboration with medical and biological experts, we developed a variant calling approach called appreci8 - A Pipeline for PREcise variant Calling Integrating 8 tools. Our pipeline combines and filters the output of 8 previously evaluated open-source variant calling tools, using a novel artifact- and polymorphism score. The algorithm aims at reproducing a biologist's manual work when investigating a raw list of calls. Detailed analysis of 7 well-characterized targeted sequencing data sets (678 samples) shows that appreci8 succeeds in calling variants with allelic frequencies at 1% with high sensitivity (0.93 to 1.00) and high PPV (0.65 to 1.00). Appreci8 is able to outperform every individual tool as well as alternative approaches for combining the 8 tools (best alternative: GATK with sensitivity=0.82-0.95 and PPV=0.31-0.94). Extending the idea of appreci8, we also developed the appreci8R, which is an R-version of our algorithm. The appreci8R provides an additional shiny GUI and checkpoints for re-starting the analysis from intermediate results. Furthermore, all analysis parameters, including the artifact- and polymorphism score, are user-definable. Even the combination and number of input tools can be defined without restriction. Comparing appreci8 and the appreci8R we observe highly comparable performance of both approaches. Appreci8 is available via Docker (https://hub.docker.com/r/wwuimi/appreci8/), the appreci8R is available via Bioconductor (http://www.bioconductor.org/packages/release/ bioc/html/appreci8R.html). Discussion: Appreci8 and the appreci8R provide a novel approach for accurate calling of high- as well as low-frequency variants in NGS data.
Title: How to “appreci8” Variant Calling in NGS Data
Description:
Background: Calling SNVs and indels in next-generation sequencing (NGS) data is a long-discussed topic.
Numerous variant calling approaches are currently available.
Despite considerable advances in recent years, detecting low-frequency variants with equally high sensitivity and positive predictive value (PPV) is still challenging.
However, for the application of NGS in clinical routine it is essential to rely on valid variant calling results – for high- as well as low-frequency variants.
Methods and results: In close collaboration with medical and biological experts, we developed a variant calling approach called appreci8 - A Pipeline for PREcise variant Calling Integrating 8 tools.
Our pipeline combines and filters the output of 8 previously evaluated open-source variant calling tools, using a novel artifact- and polymorphism score.
The algorithm aims at reproducing a biologist's manual work when investigating a raw list of calls.
Detailed analysis of 7 well-characterized targeted sequencing data sets (678 samples) shows that appreci8 succeeds in calling variants with allelic frequencies at 1% with high sensitivity (0.
93 to 1.
00) and high PPV (0.
65 to 1.
00).
Appreci8 is able to outperform every individual tool as well as alternative approaches for combining the 8 tools (best alternative: GATK with sensitivity=0.
82-0.
95 and PPV=0.
31-0.
94).
Extending the idea of appreci8, we also developed the appreci8R, which is an R-version of our algorithm.
The appreci8R provides an additional shiny GUI and checkpoints for re-starting the analysis from intermediate results.
Furthermore, all analysis parameters, including the artifact- and polymorphism score, are user-definable.
Even the combination and number of input tools can be defined without restriction.
Comparing appreci8 and the appreci8R we observe highly comparable performance of both approaches.
Appreci8 is available via Docker (https://hub.
docker.
com/r/wwuimi/appreci8/), the appreci8R is available via Bioconductor (http://www.
bioconductor.
org/packages/release/ bioc/html/appreci8R.
html).
Discussion: Appreci8 and the appreci8R provide a novel approach for accurate calling of high- as well as low-frequency variants in NGS data.

Related Results

Comparison of Three Assays for Identification of IDH Mutations in AML
Comparison of Three Assays for Identification of IDH Mutations in AML
Introduction Isocitrate dehydrogenase (IDH) mutations are present in up to 20% of acute myeloid leukemia (AML) patients and lead to production of 2-hydroxyglutarate ...
Next-generation sequencing with emphasis on Illumina and Ion torrent platforms.
Next-generation sequencing with emphasis on Illumina and Ion torrent platforms.
Abstract Background: Next-generation sequencing is a type of deep sequencing. In comparison to the previously used Sanger's method, ...
Actionable insights: Liquid NGS in community oncology practice.
Actionable insights: Liquid NGS in community oncology practice.
e15081 Background: Liquid-based next-generation sequencing (NGS) is an integral test alongside tissue biopsy for identifying actionabl...
The impact of perceived calling on work outcomes in a nursing context: The role of career commitment and living one’s calling
The impact of perceived calling on work outcomes in a nursing context: The role of career commitment and living one’s calling
AbstractThe current study examined the impact of perceived calling on nurses’ organizational commitment, organizational citizenship behavior, workplace deviant behavior, and turnov...
Negligible effects of read trimming on the accuracy of germline short variant calling in the human genome
Negligible effects of read trimming on the accuracy of germline short variant calling in the human genome
Background Next generation sequencing (NGS) has become a standard tool in the molecular diagnostics of Mendelian disease, and the precision of such diagnostics is greatly affected ...
Small Subclones Harboring NOTCH1, SF3B1 or BIRC3 Mutations Are Clinically Irrelevant in Chronic Lymphocytic Leukemia
Small Subclones Harboring NOTCH1, SF3B1 or BIRC3 Mutations Are Clinically Irrelevant in Chronic Lymphocytic Leukemia
Abstract Introduction. Ultra-deep next generation sequencing (NGS) allows sensitive detection of mutations and estimation of their clonal abundance in tumor cell pop...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
Sleep Habits and Occurrence of Lowback Pain among Craftsmen
<span style="color: #000000; font-family: Verdana, Arial, Helvetica, sans-serif; font-size: 10px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; ...

Back to Top