Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

GWAS significance thresholds in large cohorts

View through CrossRef
AbstractWhile the p-value threshold of 5.0 × 10−8remains the standard for genome-wide association studies (GWAS) in humans and other species, it still needs to be updated to reflect the current era of large-scale GWAS, where tens of thousands of sample sizes are used to discover genetic associations at loci with smaller minor allele frequencies. In this study, we used a dataset of 348,501 individuals of European ancestry from the UK Biobank to determine the GWAS thresholds required for multiple testing corrections when considering rare and common variants in additive and dominant GWAS models. Additionally, we employed conditional and joint (COJO) analysis to quantify the proportion of false significant hits in the GWAS results for 72 traits in the UK Biobank when applying the traditional GWAS cut-off versus our newly proposed p-value thresholds. Overall, the results indicate that the conventional GWAS significance threshold of 5.0 × 10−8yields a false positive rate of between 20% and 30% in GWAS studies that utilize large sample sizes and less common variants. Instead, a more stringent GWAS p-value threshold of 5.0 × 10−9is needed when rare variants (with minor allele frequency > 0.1%) are included in the association test for both additive and dominance models within the European ancestry population.
Title: GWAS significance thresholds in large cohorts
Description:
AbstractWhile the p-value threshold of 5.
0 × 10−8remains the standard for genome-wide association studies (GWAS) in humans and other species, it still needs to be updated to reflect the current era of large-scale GWAS, where tens of thousands of sample sizes are used to discover genetic associations at loci with smaller minor allele frequencies.
In this study, we used a dataset of 348,501 individuals of European ancestry from the UK Biobank to determine the GWAS thresholds required for multiple testing corrections when considering rare and common variants in additive and dominant GWAS models.
Additionally, we employed conditional and joint (COJO) analysis to quantify the proportion of false significant hits in the GWAS results for 72 traits in the UK Biobank when applying the traditional GWAS cut-off versus our newly proposed p-value thresholds.
Overall, the results indicate that the conventional GWAS significance threshold of 5.
0 × 10−8yields a false positive rate of between 20% and 30% in GWAS studies that utilize large sample sizes and less common variants.
Instead, a more stringent GWAS p-value threshold of 5.
0 × 10−9is needed when rare variants (with minor allele frequency > 0.
1%) are included in the association test for both additive and dominance models within the European ancestry population.

Related Results

Valid inference for machine learning-assisted GWAS
Valid inference for machine learning-assisted GWAS
AbstractMachine learning (ML) has revolutionized analytical strategies in almost all scientific disciplines including human genetics and genomics. Due to challenges in sample colle...
Causality between cholelithiasis and ileus: a two-sample Mendelian randomization study
Causality between cholelithiasis and ileus: a two-sample Mendelian randomization study
Abstract Background: Cholelithiasis is a prevalent digestive ailment in China, prompting extensive research on its association with ileus. However, prior investigations rel...
Abstract ML-1: Pharmacogenomics in the Quest for Precision Endocrine Therapy of Breast Cancer
Abstract ML-1: Pharmacogenomics in the Quest for Precision Endocrine Therapy of Breast Cancer
Abstract Endocrine therapy, with SERMs and AIs, is the most important treatment modality for the 70% of patients with ER+ early breast cancer. Clinically, there is m...
Processing genome-wide association studies within a repository of heterogeneous genomic datasets
Processing genome-wide association studies within a repository of heterogeneous genomic datasets
Abstract Background Genome Wide Association Studies (GWAS) are based on the observation of genome-wide sets of genetic variants – typically single-n...
Identification and characterization of genes involved in antioxidant traits in local Thai rice (Oryza sativa L.)
Identification and characterization of genes involved in antioxidant traits in local Thai rice (Oryza sativa L.)
Developing rice (Oryza sativa L) cultivars with high antioxidant activities have become increasingly important since they have nutritional advantages for human health. Hence, the ...
Development of absolute thresholds in chickens
Development of absolute thresholds in chickens
Absolute auditory thresholds were estimated in chickens at 0 and 4 days after hatching. Momentary suppressions of the chicks’ regular peeping, following the onset of a tone, were u...
Clinical and Molecular Characteristics of NPM1MTDe Novo AML ( NPM1MT dnAML) Differ from NPM1MT therapy-associated AML ( NPM1MT tAML)
Clinical and Molecular Characteristics of NPM1MTDe Novo AML ( NPM1MT dnAML) Differ from NPM1MT therapy-associated AML ( NPM1MT tAML)
Background: NPM1-mutated AML accounts for 30% of all adult AML cases and frequently carries a favorable prognostic impact when enriched by a normal karyotype and the absence of FLT...

Back to Top