Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Are methylation beta-values simplex distributed?

View through CrossRef
Abstract DNA methylation plays an important role in the development and progression of disease. Beta-values are the standard methylation measures. Different statistical methods have been proposed to assess differences in methylation between conditions. However, most of them do not completely account for the distribution of beta-values. The simplex distribution can accommodate beta-values data. We hypothesize that simplex is a quite flexible distribution which is able to model methylation data. To test our hypothesis, we conducted several analyses using four real data sets obtained from microarrays and sequencing technologies. Standard data distributions were studied and modelled in comparison to the simplex. Besides, some simulations were conducted in different scenarios encompassing several distribution assumptions, regression models and sample sizes. Finally, we compared DNA methylation between females and males in order to benchmark the assessed methodologies under different scenarios. According to the results obtained by the simulations and real data analyses, DNA methylation data are concordant with the simplex distribution in many situations. Simplex regression models work well in small sample size data sets. However, when sample size increases, other models such as the beta regression or even the linear regression can be employed to assess group comparisons and obtain unbiased results. Based on these results, we can provide some practical recommendations when analyzing methylation data: 1) use data sets of at least 10 samples per studied condition for microarray data sets or 30 in NGS data sets, 2) apply a simplex or beta regression model for microarray data, 3) apply a linear model in any other case.
Title: Are methylation beta-values simplex distributed?
Description:
Abstract DNA methylation plays an important role in the development and progression of disease.
Beta-values are the standard methylation measures.
Different statistical methods have been proposed to assess differences in methylation between conditions.
However, most of them do not completely account for the distribution of beta-values.
The simplex distribution can accommodate beta-values data.
We hypothesize that simplex is a quite flexible distribution which is able to model methylation data.
To test our hypothesis, we conducted several analyses using four real data sets obtained from microarrays and sequencing technologies.
Standard data distributions were studied and modelled in comparison to the simplex.
Besides, some simulations were conducted in different scenarios encompassing several distribution assumptions, regression models and sample sizes.
Finally, we compared DNA methylation between females and males in order to benchmark the assessed methodologies under different scenarios.
According to the results obtained by the simulations and real data analyses, DNA methylation data are concordant with the simplex distribution in many situations.
Simplex regression models work well in small sample size data sets.
However, when sample size increases, other models such as the beta regression or even the linear regression can be employed to assess group comparisons and obtain unbiased results.
Based on these results, we can provide some practical recommendations when analyzing methylation data: 1) use data sets of at least 10 samples per studied condition for microarray data sets or 30 in NGS data sets, 2) apply a simplex or beta regression model for microarray data, 3) apply a linear model in any other case.

Related Results

Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Abstract Intracerebral infection of certain strains of mice with Theiler's virus results in chronic immune-mediated demyelination in spinal cord. We used mouse mutan...
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
AbstractDNA methylation is an important component in vital biological functions such as embryonic development, carcinogenesis, and heritable regulation. Accurate methods to assess ...
Comparative Promoter Methylation Analysis of p53 Target Genes in Urogenital Cancers
Comparative Promoter Methylation Analysis of p53 Target Genes in Urogenital Cancers
<i>Introduction:</i> The methylation status of selected new p53 target genes in bladder, kidney and testicular cancer was investigated to find similarities in methylati...
Comprehensive IsomiR sequencing profile of human pancreatic islets and EndoC-βH1 beta-cells
Comprehensive IsomiR sequencing profile of human pancreatic islets and EndoC-βH1 beta-cells
AbstractAims/HypothesisMiRNAs play a crucial role in regulating the islet transcriptome, influencing beta cell functions and pathways. Emerging evidence suggests that during biogen...
Abstract 1425: Prognostic significance of promoter DNA methylation in patients with neuroblastoma
Abstract 1425: Prognostic significance of promoter DNA methylation in patients with neuroblastoma
Abstract Background: Neuroblastoma is a genetically heterogenic tumor diagnosed in childhood which exhibits broad clinical diversity ranging from rapid disease progr...
Persistent infection of hepatitis B virus is involved in high rate of p16 methylation in hepatocellular carcinoma
Persistent infection of hepatitis B virus is involved in high rate of p16 methylation in hepatocellular carcinoma
AbstractHigh rate of chronic hepatitis B virus (HBV) infection and p16 promoter methylation were found in the majority of hepatocellular carcinoma (HCC). To investigate the potenti...

Back to Top