Javascript must be enabled to continue!
DNA-sequence and epigenomic determinants of local rates of transcription elongation
View through CrossRef
Abstract
Across all branches of life, transcription elongation is a crucial, regulated phase in gene expression. Many recent studies in eukaryotes have focused on the regulation of promoter-proximal pausing of RNA Polymerase II (Pol II), but rates of productive elongation also vary substantially throughout the gene body, both within and across genes. Here, we introduce a probabilistic model for systematically evaluating potential determinants of the local elongation rate based on nascent RNA sequencing (NRS) data. Our model is derived from a unified model for both the kinetics of Pol II movement along the DNA template and the generation of NRS read counts at steady state. It allows for a continuously variable elongation rate along the gene body, with the rate at each nucleotide defined by a generalized linear relationship with nearby genomic and epigenomic features. High-dimensional feature vectors are accommodated through a sparse-regression extension. We show with simulations that the model allows accurate detection of associated features and accurate prediction of local elongation rates. In an analysis of public PRO-seq and epigenomic data, we identify several features that are strongly associated with reductions in the local elongation rate, including DNA methylation, splice sites, RNA stem-loops, CTCF binding sites, and several histone marks, including H3K36me3 and H4K20me1. By contrast, low-complexity sequences and H3K79me2 marks are associated with increases in elongation rate. In an analysis of DNA
k
-mers, we find that cytosine nucleotides are strongly associated with reductions in local elongation rate, particularly when preceded by guanines and followed by adenines or thymines. Increases in elongation rate are associated with thymines and A+T-rich
k
-mers. These associations are generally shared across cell types, and by considering them our model is effective at predicting features of held-out PRO-seq data. Overall, our analysis is the first to permit genome-wide predictions of relative nucleotide-specific elongation rates based on complex sets of genomic and epigenomic covariates. We have made predictions available for the K562, CD14+, MCF-7, and HeLa-S3 cell types in a UCSC Genome Browser track.
Title: DNA-sequence and epigenomic determinants of local rates of transcription elongation
Description:
Abstract
Across all branches of life, transcription elongation is a crucial, regulated phase in gene expression.
Many recent studies in eukaryotes have focused on the regulation of promoter-proximal pausing of RNA Polymerase II (Pol II), but rates of productive elongation also vary substantially throughout the gene body, both within and across genes.
Here, we introduce a probabilistic model for systematically evaluating potential determinants of the local elongation rate based on nascent RNA sequencing (NRS) data.
Our model is derived from a unified model for both the kinetics of Pol II movement along the DNA template and the generation of NRS read counts at steady state.
It allows for a continuously variable elongation rate along the gene body, with the rate at each nucleotide defined by a generalized linear relationship with nearby genomic and epigenomic features.
High-dimensional feature vectors are accommodated through a sparse-regression extension.
We show with simulations that the model allows accurate detection of associated features and accurate prediction of local elongation rates.
In an analysis of public PRO-seq and epigenomic data, we identify several features that are strongly associated with reductions in the local elongation rate, including DNA methylation, splice sites, RNA stem-loops, CTCF binding sites, and several histone marks, including H3K36me3 and H4K20me1.
By contrast, low-complexity sequences and H3K79me2 marks are associated with increases in elongation rate.
In an analysis of DNA
k
-mers, we find that cytosine nucleotides are strongly associated with reductions in local elongation rate, particularly when preceded by guanines and followed by adenines or thymines.
Increases in elongation rate are associated with thymines and A+T-rich
k
-mers.
These associations are generally shared across cell types, and by considering them our model is effective at predicting features of held-out PRO-seq data.
Overall, our analysis is the first to permit genome-wide predictions of relative nucleotide-specific elongation rates based on complex sets of genomic and epigenomic covariates.
We have made predictions available for the K562, CD14+, MCF-7, and HeLa-S3 cell types in a UCSC Genome Browser track.
Related Results
Cocaine-induced DNA-PK relieves RNAP II pausing by promoting TRIM28 phosphorylation
Cocaine-induced DNA-PK relieves RNAP II pausing by promoting TRIM28 phosphorylation
AbstractDrug abuse continues to pose a significant challenge in HIV control efforts. In our investigation, we discovered that cocaine not only upregulates the expression of DNA-dep...
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Abstract
Background: Age-associated epigenetic alteration is the underlying cause of DNA damage in aging cells. Two types of youth-associated DNA-protection epigenetic mark...
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Genome wide hypomethylation and youth-associated DNA gap reduction promoting DNA damage and senescence-associated pathogenesis
Introduction: The United States currently faces two opioid crises, an evolved crisis currently manifesting as widespread abuse of illicit opioids, and a crisis in pain management l...
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Echinococcus granulosus in Environmental Samples: A Cross-Sectional Molecular Study
Abstract
Introduction
Echinococcosis, caused by tapeworms of the Echinococcus genus, remains a significant zoonotic disease globally. The disease is particularly prevalent in areas...
Theory of transcription bursting: Stochasticity in the transcription rates
Theory of transcription bursting: Stochasticity in the transcription rates
ABSTRACT
Transcription bursting creates variation among the individuals of a given population. Bursting emerges as the consequence of turning on ...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
A network of epigenomic and transcriptional cooperation encompassing an epigenomic master regulator in cancer
A network of epigenomic and transcriptional cooperation encompassing an epigenomic master regulator in cancer
Abstract
Coordinated experiments focused on transcriptional responses and chromatin states are well-equipped to capture different epigenomic and transcriptomic leve...
Abstract 2105: Cell-free DNA fragments inform epigenomic mechanisms for early detection of breast cancer
Abstract 2105: Cell-free DNA fragments inform epigenomic mechanisms for early detection of breast cancer
Abstract
Introduction: Chromatin accessibility and cell-free DNA fragmentation patterns can be used to identify epigenomic mechanisms (Sharma et al. 2010) and infer ...

