Javascript must be enabled to continue!
Oligogenic combinations of rare variants influence specific phenotypes in complex disorders
View through CrossRef
ABSTRACT
Genetic studies of complex disorders such as autism and intellectual disability (ID) are often based on enrichment of individual rare variants or their aggregate burden in affected individuals compared to controls. However, these studies overlook the influence of combinations of rare variants that may not be deleterious on their own due to statistical challenges resulting from rarity and combinatorial explosion when enumerating variant combinations, limiting our ability to study oligogenic basis for these disorders. We present a framework that combines the apriori algorithm and statistical inference to identify specific combinations of mutated genes associated with complex phenotypes. Our approach overcomes computational barriers and exhaustively evaluates variant combinations to identify non-additive relationships between simultaneously mutated genes. Using this approach, we analyzed 6,189 individuals with autism and identified 718 combinations significantly associated with ID, and carriers of these combinations showed lower IQ than expected in an independent cohort of 1,878 individuals. These combinations were enriched for nervous system genes such as
NIN
and
NGF
, showed complex inheritance patterns, and were depleted in unaffected siblings. We found that an affected individual can carry many oligogenic combinations, each contributing to the same phenotype or distinct phenotypes at varying effect sizes. We also used this framework to identify combinations associated with multiple comorbid phenotypes, including mutations of
COL28A1
and
MFSD2B
for ID and schizophrenia and
ABCA4, DNAH10
and
MC1R
for ID and anxiety/depression. Our framework identifies a key component of missing heritability and provides a novel paradigm to untangle the genetic architecture of complex disorders.
SIGNIFICANCE
While rare mutations in single genes or their collective burden partially explain the genetic basis for complex disorders, the role of specific combinations of rare variants is not completely understood. This is because combinations of rare variants are rarer and evaluating all possible combinations would result in a combinatorial explosion, creating difficulties for statistical and computational analysis. We developed a data mining approach that overcomes these limitations to precisely quantify the influence of combinations of two or more mutated genes on a specific clinical feature or multiple co-occurring features. Our framework provides a new paradigm for dissecting the genetic causes of complex disorders and provides an impetus for its utility in clinical diagnosis.
Title: Oligogenic combinations of rare variants influence specific phenotypes in complex disorders
Description:
ABSTRACT
Genetic studies of complex disorders such as autism and intellectual disability (ID) are often based on enrichment of individual rare variants or their aggregate burden in affected individuals compared to controls.
However, these studies overlook the influence of combinations of rare variants that may not be deleterious on their own due to statistical challenges resulting from rarity and combinatorial explosion when enumerating variant combinations, limiting our ability to study oligogenic basis for these disorders.
We present a framework that combines the apriori algorithm and statistical inference to identify specific combinations of mutated genes associated with complex phenotypes.
Our approach overcomes computational barriers and exhaustively evaluates variant combinations to identify non-additive relationships between simultaneously mutated genes.
Using this approach, we analyzed 6,189 individuals with autism and identified 718 combinations significantly associated with ID, and carriers of these combinations showed lower IQ than expected in an independent cohort of 1,878 individuals.
These combinations were enriched for nervous system genes such as
NIN
and
NGF
, showed complex inheritance patterns, and were depleted in unaffected siblings.
We found that an affected individual can carry many oligogenic combinations, each contributing to the same phenotype or distinct phenotypes at varying effect sizes.
We also used this framework to identify combinations associated with multiple comorbid phenotypes, including mutations of
COL28A1
and
MFSD2B
for ID and schizophrenia and
ABCA4, DNAH10
and
MC1R
for ID and anxiety/depression.
Our framework identifies a key component of missing heritability and provides a novel paradigm to untangle the genetic architecture of complex disorders.
SIGNIFICANCE
While rare mutations in single genes or their collective burden partially explain the genetic basis for complex disorders, the role of specific combinations of rare variants is not completely understood.
This is because combinations of rare variants are rarer and evaluating all possible combinations would result in a combinatorial explosion, creating difficulties for statistical and computational analysis.
We developed a data mining approach that overcomes these limitations to precisely quantify the influence of combinations of two or more mutated genes on a specific clinical feature or multiple co-occurring features.
Our framework provides a new paradigm for dissecting the genetic causes of complex disorders and provides an impetus for its utility in clinical diagnosis.
Related Results
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
DUVEL: an active-learning annotated biomedical corpus for the recognition of oligogenic combinations
DUVEL: an active-learning annotated biomedical corpus for the recognition of oligogenic combinations
Abstract
While biomedical relation extraction (bioRE) datasets have been instrumental in the development of methods to support biocuration of single variants from te...
Energy Measures as Biomarkers of SARS-CoV-2 Variants and Receptors
Energy Measures as Biomarkers of SARS-CoV-2 Variants and Receptors
The COVID-19 outbreak has made it evident that the nature and behavior of SARS-CoV-2 requires constant research and surveillance, owing to the high mutation rates that lead to vari...
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Clinical Implications of Germline Predisposition Gene Variants in Patients with Refractory or Relapsed B Acute Lymphoblastic Leukemia
Objectives:Gene variants are important factors in prognosis of the patients with hematological malignancies. In current study, our team investigate the relationship between blood a...
Leveraging cancer mutation data to predict the pathogenicity of germline missense variants
Leveraging cancer mutation data to predict the pathogenicity of germline missense variants
ABSTRACTInnovative and easy-to-implement strategies are needed to improve the pathogenicity assessment of rare germline missense variants. Somatic cancer driver mutations identifie...
Next generation sequencing identifies a pattern of novel germline variants in early-onset colorectal cancer
Next generation sequencing identifies a pattern of novel germline variants in early-onset colorectal cancer
Abstract
Early-onset colorectal cancer (EOCRC) incidence is increasing rapidly worldwide. However, the majority of EOCRCs are not substantiated b...
A novel genetic region prioritization tool using a new statistical framework to analyse oligogenic diseases
A novel genetic region prioritization tool using a new statistical framework to analyse oligogenic diseases
Nowadays, few bioinformatics tools are able to identify the genetic cause of oligogenic diseases. To allow such analyses and make the best use of public and in-house available exom...
Testing gene-environment interactions for rare and/or common variants in sequencing association studies
Testing gene-environment interactions for rare and/or common variants in sequencing association studies
Abstract
The risk of many complex diseases is determined by a complex interplay of genetic and environmental factors. Advanced next generation sequencing technology...

