Javascript must be enabled to continue!
Are methylation beta-values simplex distributed?
View through CrossRef
Abstract
DNA methylation plays an important role in the development and progression of disease. Beta-values are the standard methylation measures. Different statistical methods have been proposed to assess differences in methylation between conditions. However, most of them do not completely account for the distribution of beta-values. The simplex distribution can accommodate beta-values data. We hypothesize that simplex is a quite flexible distribution which is able to model methylation data.
To test our hypothesis, we conducted several analyses using four real data sets obtained from microarrays and sequencing technologies. Standard data distributions were studied and modelled in comparison to the simplex. Besides, some simulations were conducted in different scenarios encompassing several distribution assumptions, regression models and sample sizes. Finally, we compared DNA methylation between females and males in order to benchmark the assessed methodologies under different scenarios.
According to the results obtained by the simulations and real data analyses, DNA methylation data are concordant with the simplex distribution in many situations. Simplex regression models work well in small sample size data sets. However, when sample size increases, other models such as the beta regression or even the linear regression can be employed to assess group comparisons and obtain unbiased results. Based on these results, we can provide some practical recommendations when analyzing methylation data: 1) use data sets of at least 10 samples per studied condition for microarray data sets or 30 in NGS data sets, 2) apply a simplex or beta regression model for microarray data, 3) apply a linear model in any other case.
Title: Are methylation beta-values simplex distributed?
Description:
Abstract
DNA methylation plays an important role in the development and progression of disease.
Beta-values are the standard methylation measures.
Different statistical methods have been proposed to assess differences in methylation between conditions.
However, most of them do not completely account for the distribution of beta-values.
The simplex distribution can accommodate beta-values data.
We hypothesize that simplex is a quite flexible distribution which is able to model methylation data.
To test our hypothesis, we conducted several analyses using four real data sets obtained from microarrays and sequencing technologies.
Standard data distributions were studied and modelled in comparison to the simplex.
Besides, some simulations were conducted in different scenarios encompassing several distribution assumptions, regression models and sample sizes.
Finally, we compared DNA methylation between females and males in order to benchmark the assessed methodologies under different scenarios.
According to the results obtained by the simulations and real data analyses, DNA methylation data are concordant with the simplex distribution in many situations.
Simplex regression models work well in small sample size data sets.
However, when sample size increases, other models such as the beta regression or even the linear regression can be employed to assess group comparisons and obtain unbiased results.
Based on these results, we can provide some practical recommendations when analyzing methylation data: 1) use data sets of at least 10 samples per studied condition for microarray data sets or 30 in NGS data sets, 2) apply a simplex or beta regression model for microarray data, 3) apply a linear model in any other case.
Related Results
Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Role of T cell receptor V beta genes in Theiler's virus-induced demyelination of mice.
Abstract
Intracerebral infection of certain strains of mice with Theiler's virus results in chronic immune-mediated demyelination in spinal cord. We used mouse mutan...
Abstract A37: Aberrant DNA methylation of HTATIP2 and UCH-L1 as prognostic and predictive biomarkers for cholangiocarcinoma
Abstract A37: Aberrant DNA methylation of HTATIP2 and UCH-L1 as prognostic and predictive biomarkers for cholangiocarcinoma
Abstract
Cholangiocarcinoma (CCA) is a malignancy of bile duct epithelial cell lining. In the past decade, the incidence and mortality rates of CCA have been increas...
Abstract 1289: Crosstalk between methylation and alternative splicing in cancer
Abstract 1289: Crosstalk between methylation and alternative splicing in cancer
Abstract
Background: DNA methylation in promoter regions leads to transcriptional silencing, such as of tumor suppressor genes in cancer. However, the functions of D...
Methylation of the RIZ1 Gene In Myelodysplastic Syndrome and Acute Myeloid Leukemia
Methylation of the RIZ1 Gene In Myelodysplastic Syndrome and Acute Myeloid Leukemia
Abstract
Abstract 5112
Inactivation of a tumor suppressor gene is often caused by a mutation, small deletion of one allele accompanied by loss of the ...
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
Correcting Methylation Calls in Clinically Relevant Low-Mappability Regions
AbstractDNA methylation is an important component in vital biological functions such as embryonic development, carcinogenesis, and heritable regulation. Accurate methods to assess ...
Genome-Wide DNA Methylation Analysis Identifies Aberrant Epigenetic Changes in CD8+ T Cells from Chronic Lymphocytic Leukemia Patients
Genome-Wide DNA Methylation Analysis Identifies Aberrant Epigenetic Changes in CD8+ T Cells from Chronic Lymphocytic Leukemia Patients
Abstract
Background CD8+ T cells from chronic lymphocytic leukemia (CLL) patients have been demonstrated to exhibit a number of alterations in global gene expression...
P1621m6A RNA-methylation in the progression of heart failure
P1621m6A RNA-methylation in the progression of heart failure
Abstract
Introduction
N6-Methyladenosine (m6A) is the most abundant modification of RNA and was found to be a dynamic and revers...
Comparative Promoter Methylation Analysis of p53 Target Genes in Urogenital Cancers
Comparative Promoter Methylation Analysis of p53 Target Genes in Urogenital Cancers
<i>Introduction:</i> The methylation status of selected new p53 target genes in bladder, kidney and testicular cancer was investigated to find similarities in methylati...

