Javascript must be enabled to continue!
Are methylation beta-values simplex distributed?
View through CrossRef
Abstract
DNA methylation plays an important role in the development and progression of disease. Beta-values are the standard methylation measures. Different statistical methods have been proposed to assess differences in methylation between conditions. However, most of them do not completely account for the distribution of beta-values. The simplex distribution can accommodate beta-values data. We hypothesize that simplex is a quite flexible distribution which is able to model methylation data.
To test our hypothesis, we conducted several analyses using four real data sets obtained from microarrays and sequencing technologies. Standard data distributions were studied and modelled in comparison to the simplex. Besides, some simulations were conducted in different scenarios encompassing several distribution assumptions, regression models and sample sizes. Finally, we compared DNA methylation between females and males in order to benchmark the assessed methodologies under different scenarios.
According to the results obtained by the simulations and real data analyses, DNA methylation data are concordant with the simplex distribution in many situations. Simplex regression models work well in small sample size data sets. However, when sample size increases, other models such as the beta regression or even the linear regression can be employed to assess group comparisons and obtain unbiased results. Based on these results, we can provide some practical recommendations when analyzing methylation data: 1) use data sets of at least 10 samples per studied condition for microarray data sets or 30 in NGS data sets, 2) apply a simplex or beta regression model for microarray data, 3) apply a linear model in any other case.
Title: Are methylation beta-values simplex distributed?
Description:
Abstract
DNA methylation plays an important role in the development and progression of disease.
Beta-values are the standard methylation measures.
Different statistical methods have been proposed to assess differences in methylation between conditions.
However, most of them do not completely account for the distribution of beta-values.
The simplex distribution can accommodate beta-values data.
We hypothesize that simplex is a quite flexible distribution which is able to model methylation data.
To test our hypothesis, we conducted several analyses using four real data sets obtained from microarrays and sequencing technologies.
Standard data distributions were studied and modelled in comparison to the simplex.
Besides, some simulations were conducted in different scenarios encompassing several distribution assumptions, regression models and sample sizes.
Finally, we compared DNA methylation between females and males in order to benchmark the assessed methodologies under different scenarios.
According to the results obtained by the simulations and real data analyses, DNA methylation data are concordant with the simplex distribution in many situations.
Simplex regression models work well in small sample size data sets.
However, when sample size increases, other models such as the beta regression or even the linear regression can be employed to assess group comparisons and obtain unbiased results.
Based on these results, we can provide some practical recommendations when analyzing methylation data: 1) use data sets of at least 10 samples per studied condition for microarray data sets or 30 in NGS data sets, 2) apply a simplex or beta regression model for microarray data, 3) apply a linear model in any other case.
Related Results
Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Abstract
Intracerebral infection of certain strains of mice with Theiler's virus results in chronic immune-mediated demyelination in spinal cord. We used mouse mutan...
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
AbstractDNA methylation is an important component in vital biological functions such as embryonic development, carcinogenesis, and heritable regulation. Accurate methods to assess ...
Comparative Promoter Methylation Analysis of p53 Target Genes in Urogenital Cancers
Comparative Promoter Methylation Analysis of p53 Target Genes in Urogenital Cancers
<i>Introduction:</i> The methylation status of selected new p53 target genes in bladder, kidney and testicular cancer was investigated to find similarities in methylati...
Whole-genome bisulfite sequencing of multiple individuals reveals complementary roles of promoter and gene body methylation in transcriptional regulation
Whole-genome bisulfite sequencing of multiple individuals reveals complementary roles of promoter and gene body methylation in transcriptional regulation
Abstract
Background
DNA methylation is an important type of epigenetic modification involved in gene regulation. Although strong DNA...
Comprehensive IsomiR sequencing profile of human pancreatic islets and EndoC-βH1 beta-cells
Comprehensive IsomiR sequencing profile of human pancreatic islets and EndoC-βH1 beta-cells
AbstractAims/HypothesisMiRNAs play a crucial role in regulating the islet transcriptome, influencing beta cell functions and pathways. Emerging evidence suggests that during biogen...
Bioinformatics Unravels the Epigenetic Mechanisms of Hashimoto’s Thyroiditis: Deciphering Molecular Complexity
Bioinformatics Unravels the Epigenetic Mechanisms of Hashimoto’s Thyroiditis: Deciphering Molecular Complexity
ABSTRACT
Introduction
Recent research in the field of epigenetics has shed light on the impact of epigenetic modifications in t...
Abstract 1425: Prognostic significance of promoter DNA methylation in patients with neuroblastoma
Abstract 1425: Prognostic significance of promoter DNA methylation in patients with neuroblastoma
Abstract
Background: Neuroblastoma is a genetically heterogenic tumor diagnosed in childhood which exhibits broad clinical diversity ranging from rapid disease progr...
Persistent infection of hepatitis B virus is involved in high rate of p16 methylation in hepatocellular carcinoma
Persistent infection of hepatitis B virus is involved in high rate of p16 methylation in hepatocellular carcinoma
AbstractHigh rate of chronic hepatitis B virus (HBV) infection and p16 promoter methylation were found in the majority of hepatocellular carcinoma (HCC). To investigate the potenti...

