Javascript must be enabled to continue!
Prioritizing Complex Disease Genes from Heterogeneous Public Databases
View through CrossRef
AbstractBackgroundComplex human diseases are defined not only by sophisticated patterns of genetic variants/mutations upstream but also by many interplaying genes, RNAs, and proteins downstream. Analyzing multiple genomic and functional genomic data types to determine a short list of genes or molecules of interest is a common task called “gene prioritization” in biology. There are many statistical, biological, and bioinformatic methods developed to perform gene prioritization tasks. However, little research has been conducted to examine the relationships among the technique used, merged/separate use of each data modality, the gene list’s network/pathway context, and various gene ranking/expansions.MethodsWe introduce a new analytical framework called “Gene Ranking and Iterative Prioritization based on Pathways” (GRIPP) to prioritize genes derived from different modalities. Multiple data sources, such as CBioPortal, PAGER, and COSMIC were used to compile the initial gene list. We used the PAGER software to expand the gene list based on biological pathways and the BEERE software to construct protein-protein interaction networks that include the gene list to rank order genes. We produced a final gene list for each data modality iteratively from an initial draft gene list, using glioblastoma multiform (GBM) as a case study.ConclusionWe demonstrated that GBM gene lists obtained from three modalities (differential gene expressions, gene mutations, and copy number alterations) and several data sources could be iteratively expanded and ranked using GRIPP. While integrating various modalities of data can be useful to generate an integrated ranked gene list related to any specific disease, the integration may also decrease the overall significance of ranked genes derived from specific data modalities. Therefore, we recommend carefully sorting and integrating gene lists according to each modality, such as gene mutations, epigenetic controls, or differential expressions, to procure modality-specific biological insights into the prioritized genes.
Title: Prioritizing Complex Disease Genes from Heterogeneous Public Databases
Description:
AbstractBackgroundComplex human diseases are defined not only by sophisticated patterns of genetic variants/mutations upstream but also by many interplaying genes, RNAs, and proteins downstream.
Analyzing multiple genomic and functional genomic data types to determine a short list of genes or molecules of interest is a common task called “gene prioritization” in biology.
There are many statistical, biological, and bioinformatic methods developed to perform gene prioritization tasks.
However, little research has been conducted to examine the relationships among the technique used, merged/separate use of each data modality, the gene list’s network/pathway context, and various gene ranking/expansions.
MethodsWe introduce a new analytical framework called “Gene Ranking and Iterative Prioritization based on Pathways” (GRIPP) to prioritize genes derived from different modalities.
Multiple data sources, such as CBioPortal, PAGER, and COSMIC were used to compile the initial gene list.
We used the PAGER software to expand the gene list based on biological pathways and the BEERE software to construct protein-protein interaction networks that include the gene list to rank order genes.
We produced a final gene list for each data modality iteratively from an initial draft gene list, using glioblastoma multiform (GBM) as a case study.
ConclusionWe demonstrated that GBM gene lists obtained from three modalities (differential gene expressions, gene mutations, and copy number alterations) and several data sources could be iteratively expanded and ranked using GRIPP.
While integrating various modalities of data can be useful to generate an integrated ranked gene list related to any specific disease, the integration may also decrease the overall significance of ranked genes derived from specific data modalities.
Therefore, we recommend carefully sorting and integrating gene lists according to each modality, such as gene mutations, epigenetic controls, or differential expressions, to procure modality-specific biological insights into the prioritized genes.
Related Results
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct
Introduction
Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Identification and Validation of Key m7G-Related Genes and Cuproptosis-Related Genes Related to Immunity in Polycystic Ovary Syndrome (PCOS) by Comprehensive Bioinformatics Analysis
Identification and Validation of Key m7G-Related Genes and Cuproptosis-Related Genes Related to Immunity in Polycystic Ovary Syndrome (PCOS) by Comprehensive Bioinformatics Analysis
Abstract
Background
Polycystic ovary syndrome (PCOS) is a complex endocrine disease that is one of the most common disease of reproductive age and is one of the important ...
Exploring the Diagnosis of Immune-Related Genes in Metabolic Syndrome Based on Three Algorithms
Exploring the Diagnosis of Immune-Related Genes in Metabolic Syndrome Based on Three Algorithms
Abstract
Background
The pathogenesis of Metabolic Syndrome (MetS) remains largely unexplored. This study aims to explore the immune-related genes in MetS.
Methods
The mic...
Identification of differentially expressed genes associated with ferroptosis in Crohn’s disease
Identification of differentially expressed genes associated with ferroptosis in Crohn’s disease
Abstract
Objective:Ferroptosis-related genes may have a critical regulatory role in the pathogenetic process of Crohn’ disease(CD).The purpose of this study was to identify...
SO004IDENTIFICATION OF HUB GENES, FOCUSED ON CHEMOKINES AND CHEMOKINE RECEPTORS, ASSOCIATED WITH THE DECLINE OF RENAL FUCTION OF DIABETIC KIDNEY DISEASE BY WEIGHTED GENE CO-EXPRESSION NETWORK ANALYSIS
SO004IDENTIFICATION OF HUB GENES, FOCUSED ON CHEMOKINES AND CHEMOKINE RECEPTORS, ASSOCIATED WITH THE DECLINE OF RENAL FUCTION OF DIABETIC KIDNEY DISEASE BY WEIGHTED GENE CO-EXPRESSION NETWORK ANALYSIS
Abstract
Background and Aims
The fact that activation of the innate immune system and chronic inflammation are closely involved ...
Identification of key genes in DN based on lipid metabolism
Identification of key genes in DN based on lipid metabolism
Abstract
Background
Diabetic nephropathy (DN), which is one of the most common systemic microvascular complications of diabetes mellitus, is extremely harmful to the patie...
Transcriptome combined with Mendelian randomization to screen key genes associated with mitochondrial and programmed cell death causally associated with diabetic retinopathy
Transcriptome combined with Mendelian randomization to screen key genes associated with mitochondrial and programmed cell death causally associated with diabetic retinopathy
BackgroundMitochondrial dysfunction in the retina can induce apoptosis of retinal capillary cells, leading to diabetic retinopathy (DR). This study aimed to explore key genes relat...
Transcriptomic Analysis of Medicago Truncatula under Long Day Conditions
Transcriptomic Analysis of Medicago Truncatula under Long Day Conditions
To explore the expression characteristics and biological functions of related genes of medicago terrestris under long day conditions, and to lay a foundation for revealing the mole...

