Javascript must be enabled to continue!
Valid inference for machine learning-assisted GWAS
View through CrossRef
Abstract
Machine learning (ML) has revolutionized analytical strategies in almost all scientific disciplines including human genetics and genomics. Due to challenges in sample collection and precise phenotyping, ML-assisted genome-wide association study (GWAS) which uses sophisticated ML to impute phenotypes and then performs GWAS on imputed outcomes has quickly gained popularity in complex trait genetics research. However, the validity of associations identified from ML-assisted GWAS has not been carefully evaluated. In this study, we report pervasive risks for false positive associations in ML-assisted GWAS, and introduce POP-GWAS, a novel statistical framework that reimagines GWAS on ML-imputed outcomes. POP-GWAS provides valid statistical inference irrespective of the quality of imputation or variables and algorithms used for imputation. It also only requires GWAS summary statistics as input. We employed POP-GWAS to perform the largest GWAS of bone mineral density (BMD) derived from dual-energy X-ray absorptiometry imaging at 14 skeletal sites, identifying 89 novel loci reaching genome-wide significance and revealing skeletal site-specific genetic architecture of BMD. Our framework may fundamentally reshape the analytical strategies in future ML-assisted GWAS.
Title: Valid inference for machine learning-assisted GWAS
Description:
Abstract
Machine learning (ML) has revolutionized analytical strategies in almost all scientific disciplines including human genetics and genomics.
Due to challenges in sample collection and precise phenotyping, ML-assisted genome-wide association study (GWAS) which uses sophisticated ML to impute phenotypes and then performs GWAS on imputed outcomes has quickly gained popularity in complex trait genetics research.
However, the validity of associations identified from ML-assisted GWAS has not been carefully evaluated.
In this study, we report pervasive risks for false positive associations in ML-assisted GWAS, and introduce POP-GWAS, a novel statistical framework that reimagines GWAS on ML-imputed outcomes.
POP-GWAS provides valid statistical inference irrespective of the quality of imputation or variables and algorithms used for imputation.
It also only requires GWAS summary statistics as input.
We employed POP-GWAS to perform the largest GWAS of bone mineral density (BMD) derived from dual-energy X-ray absorptiometry imaging at 14 skeletal sites, identifying 89 novel loci reaching genome-wide significance and revealing skeletal site-specific genetic architecture of BMD.
Our framework may fundamentally reshape the analytical strategies in future ML-assisted GWAS.
Related Results
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
PENGEMBANGAN PERANGKAT PEMBELAJARAN IPS DENGAN MENGGUNAKAN METODE BRAINSTORMING UNTUK MENINGKATKAN BERPIKIR KRITIS DI KELAS V SEKOLAH DASAR
PENGEMBANGAN PERANGKAT PEMBELAJARAN IPS DENGAN MENGGUNAKAN METODE BRAINSTORMING UNTUK MENINGKATKAN BERPIKIR KRITIS DI KELAS V SEKOLAH DASAR
ABSTRACTResearch on this development was intended to produce device learning of a syllabus, lesson plans, sheets of students activity, book students, and tests the ability of think...
GWAS significance thresholds in large cohorts
GWAS significance thresholds in large cohorts
AbstractWhile the p-value threshold of 5.0 × 10−8remains the standard for genome-wide association studies (GWAS) in humans and other species, it still needs to be updated to reflec...
Causality between cholelithiasis and ileus: a two-sample Mendelian randomization study
Causality between cholelithiasis and ileus: a two-sample Mendelian randomization study
Abstract
Background: Cholelithiasis is a prevalent digestive ailment in China, prompting extensive research on its association with ileus. However, prior investigations rel...
Linking GWAS to pharmacological treatments for psychiatric disorders
Linking GWAS to pharmacological treatments for psychiatric disorders
Abstract
Importance
Large-scale genome-wide association studies (GWASs) are expected to inform the development of pharmacologic...
PENGEMBANGAN PERANGKAT PEMBELAJARAN IPA DENGAN MENGGUNAKAN METODE BRAINSTORMING UNTUK MENINGKATKAN BERPIKIR KRITIS DI KELAS V SEKOLAH DASAR
PENGEMBANGAN PERANGKAT PEMBELAJARAN IPA DENGAN MENGGUNAKAN METODE BRAINSTORMING UNTUK MENINGKATKAN BERPIKIR KRITIS DI KELAS V SEKOLAH DASAR
ABSTRACTResearch on this development was intended to produce device learning of a syllabus, lesson plans, sheets of students activity, book students, and tests the ability of think...
e-GRASP: an integrated evolutionary and GRASP resource for exploring disease associations
e-GRASP: an integrated evolutionary and GRASP resource for exploring disease associations
Abstract
Background
Genome-wide association studies (GWAS) have become a mainstay of biological research concerned with d...

