Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

ATAV: a comprehensive platform for population-scale genomic analyses

View through CrossRef
Abstract Background A common approach for sequencing studies is to do joint-calling and store variants of all samples in a single file. If new samples keep being added or controls are re-used for several studies, the cost and time required to perform joint-calling for each analysis can become prohibitive. Results We present ATAV, an analysis platform for large-scale whole-exome and whole-genome sequencing projects. ATAV stores variant and per site coverage data for all samples in a centralized database, which is efficiently queried by ATAV to support diagnostic analyses for trios and singletons, as well as rare-variant collapsing analyses for finding disease associations in complex diseases. Runtime logs ensure full reproducibility and the modularized ATAV framework makes it extensible to continuous development. Besides helping with the identification of disease-causing variants for a range of diseases, ATAV has also enabled the discovery of disease-genes by rare-variant collapsing on datasets containing more than 20,000 samples. Analyses to date have been performed on data of more than 110,000 individuals demonstrating the scalability of the framework. The ATAV data browser ( http://atavdb.org/ ) is a web-based interface that allows users to easily access variant-level data directly from the database. Summary-level data for more than 40,000 samples can be queried by the general public representing a mix of cases and controls of diverse ancestries. Users have access to phenotype categories of variant carriers, as well as predicted ancestry, gender, and quality metrics. In contrast to many other platforms, the data browser is able to show data of newly-added samples in real-time and is therefore evolving rapidly as more and more samples are sequenced. Conclusions Since all code is freely available on GitHub, ATAV can easily be used by other groups to build up their own platform, database, and user interface. In addition to that users can query one of the largest variant databases for patients sequenced at a tertiary care center and look up their own genes or variants of interest.
Title: ATAV: a comprehensive platform for population-scale genomic analyses
Description:
Abstract Background A common approach for sequencing studies is to do joint-calling and store variants of all samples in a single file.
If new samples keep being added or controls are re-used for several studies, the cost and time required to perform joint-calling for each analysis can become prohibitive.
Results We present ATAV, an analysis platform for large-scale whole-exome and whole-genome sequencing projects.
ATAV stores variant and per site coverage data for all samples in a centralized database, which is efficiently queried by ATAV to support diagnostic analyses for trios and singletons, as well as rare-variant collapsing analyses for finding disease associations in complex diseases.
Runtime logs ensure full reproducibility and the modularized ATAV framework makes it extensible to continuous development.
Besides helping with the identification of disease-causing variants for a range of diseases, ATAV has also enabled the discovery of disease-genes by rare-variant collapsing on datasets containing more than 20,000 samples.
Analyses to date have been performed on data of more than 110,000 individuals demonstrating the scalability of the framework.
The ATAV data browser ( http://atavdb.
org/ ) is a web-based interface that allows users to easily access variant-level data directly from the database.
Summary-level data for more than 40,000 samples can be queried by the general public representing a mix of cases and controls of diverse ancestries.
Users have access to phenotype categories of variant carriers, as well as predicted ancestry, gender, and quality metrics.
In contrast to many other platforms, the data browser is able to show data of newly-added samples in real-time and is therefore evolving rapidly as more and more samples are sequenced.
Conclusions Since all code is freely available on GitHub, ATAV can easily be used by other groups to build up their own platform, database, and user interface.
In addition to that users can query one of the largest variant databases for patients sequenced at a tertiary care center and look up their own genes or variants of interest.

Related Results

Accuracy and computational efficiency of genomic selection with high-density SNP and whole-genome sequence data.
Accuracy and computational efficiency of genomic selection with high-density SNP and whole-genome sequence data.
Abstract The prediction of complex or quantitative traits from single nucleotide polymorphism (SNP) genotypes has transformed livestock and plant breeding, and is also pl...
Deepwater Platform Design
Deepwater Platform Design
ABSTRACT Results of preliminary platform design studies enable engineers"to identify and appraise major variables affecting deepwater platform capabilities. Trans...
Analysis of the Validity of Urine LAM ELISA for Tuberculosis Infection
Analysis of the Validity of Urine LAM ELISA for Tuberculosis Infection
Objective: To explore the validity of urinary lipoarabinomannan (LAM) enzyme-linked immunosorbent assay (ELISA) assay technology for detecting MTB infection in the double infection...
Array‐Based Genomics in Glioma Research
Array‐Based Genomics in Glioma Research
AbstractOver the years, several relevant biomarkers with a potential clinical interest have been identified in gliomas using various techniques, such as karyotype, microsatellite a...
Abstract P2-06-04: Impact of genomic testing on chemotherapy utilization
Abstract P2-06-04: Impact of genomic testing on chemotherapy utilization
Abstract Genomic testing for breast cancer has become a common practice. The two main tests at our institution are Oncotype DX® and MammaPrint®. Each test has unique...
The Impact of Genomic Sequencing on Veterinary Diagnostics
The Impact of Genomic Sequencing on Veterinary Diagnostics
Genomic sequencing has revolutionized veterinary diagnostics by providing a comprehensive understanding of an animal's genetic makeup and its implications for health and disease. B...
Genomic predictors of drug sensitivity in cancer: Integrating genomic data for personalized medicine in the USA
Genomic predictors of drug sensitivity in cancer: Integrating genomic data for personalized medicine in the USA
Despite applying conventional predictive methodologies to obtain genomic insights, predicting drug sensitivity for healthcare organizations in the USA remains a daunting challenge....

Back to Top