Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Integrating mean and variance heterogeneities to identify differentially expressed genes

View through CrossRef
Abstract Background In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions. Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment. The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect. Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes. Results In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition. We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests. Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change. The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings. For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.e., the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates. In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests. Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity. After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment-wide significant MVDE genes. Conclusions Our results indicate tremendous potential gain of integrating informative variance heterogeneity after adjusting for global confounders and background data structure. The proposed informative integration test better summarizes the impacts of condition change on expression distributions of susceptible genes than do the existent competitors. Therefore, particular attention should be paid to explicitly exploit the variance heterogeneity induced by condition change in functional genomics analysis.
Title: Integrating mean and variance heterogeneities to identify differentially expressed genes
Description:
Abstract Background In functional genomics studies, tests on mean heterogeneity have been widely employed to identify differentially expressed genes with distinct mean expression levels under different experimental conditions.
Variance heterogeneity (aka, the difference between condition-specific variances) of gene expression levels is simply neglected or calibrated for as an impediment.
The mean heterogeneity in the expression level of a gene reflects one aspect of its distribution alteration; and variance heterogeneity induced by condition change may reflect another aspect.
Change in condition may alter both mean and some higher-order characteristics of the distributions of expression levels of susceptible genes.
Results In this report, we put forth a conception of mean-variance differentially expressed (MVDE) genes, whose expression means and variances are sensitive to the change in experimental condition.
We mathematically proved the null independence of existent mean heterogeneity tests and variance heterogeneity tests.
Based on the independence, we proposed an integrative mean-variance test (IMVT) to combine gene-wise mean heterogeneity and variance heterogeneity induced by condition change.
The IMVT outperformed its competitors under comprehensive simulations of normality and Laplace settings.
For moderate samples, the IMVT well controlled type I error rates, and so did existent mean heterogeneity test (i.
e.
, the Welch t test (WT), the moderated Welch t test (MWT)) and the procedure of separate tests on mean and variance heterogeneities (SMVT), but the likelihood ratio test (LRT) severely inflated type I error rates.
In presence of variance heterogeneity, the IMVT appeared noticeably more powerful than all the valid mean heterogeneity tests.
Application to the gene profiles of peripheral circulating B raised solid evidence of informative variance heterogeneity.
After adjusting for background data structure, the IMVT replicated previous discoveries and identified novel experiment-wide significant MVDE genes.
Conclusions Our results indicate tremendous potential gain of integrating informative variance heterogeneity after adjusting for global confounders and background data structure.
The proposed informative integration test better summarizes the impacts of condition change on expression distributions of susceptible genes than do the existent competitors.
Therefore, particular attention should be paid to explicitly exploit the variance heterogeneity induced by condition change in functional genomics analysis.

Related Results

Transcriptomic Analysis of Medicago Truncatula under Long Day Conditions
Transcriptomic Analysis of Medicago Truncatula under Long Day Conditions
To explore the expression characteristics and biological functions of related genes of medicago terrestris under long day conditions, and to lay a foundation for revealing the mole...
Construction of a Feature Gene and Machine Prediction Model for Inflammatory Bowel Disease Based on Multi - Chip Joint Analysis
Construction of a Feature Gene and Machine Prediction Model for Inflammatory Bowel Disease Based on Multi - Chip Joint Analysis
Abstract Background Inflammatory bowel disease (IBD) is a chronic non - specific inflammatory disorder triggered by immune responses and genetic factors. Currently, there ...
XA4C: eXplainable representation learning via Autoencoders revealing Critical genes
XA4C: eXplainable representation learning via Autoencoders revealing Critical genes
ABSTRACT Machine Learning models have been frequently used in transcriptome analyses. Particularly, Representation Learning (RL), e.g., autoencod...
Detection and characterisation of heterogeneities in the WISDOM/ExoMars 2022 radargrams.
Detection and characterisation of heterogeneities in the WISDOM/ExoMars 2022 radargrams.
 Introduction The principal objective of Rosalind Franklin, the ExoMars Rover, is to look for evidence of past or present life on Mars. Such evidence wou...
Identification and Validation of Mitophagy-Related Genes in Diabetic Retinopathy
Identification and Validation of Mitophagy-Related Genes in Diabetic Retinopathy
Background Diabetic retinopathy is one of the common chronic complications of diabetes, characterized by retinal microvascular and neurodegenerative impairment, a...
Screening candidate genes related to psoas muscle traits in Debao and Landrace pigs based on transcriptome analysis
Screening candidate genes related to psoas muscle traits in Debao and Landrace pigs based on transcriptome analysis
ABSTRACTTo identify the important genes that affect the phenotypic differences between the Landrace and Debao pigs, especially the differences in metabolism and muscle growth. Diff...
Identification of key genes unique to the luminal A and basal-like breast cancer subtypes via bioinformatic analysis
Identification of key genes unique to the luminal A and basal-like breast cancer subtypes via bioinformatic analysis
Abstract Background Breast cancer subtypes are statistically associated with prognosis. The search for markers of breast tumor heterogeneity and the development of precisio...

Back to Top