Javascript must be enabled to continue!
Predicting sequence and structural specificities of RNA binding regions recognized by splicing factor SRSF1
View through CrossRef
Abstract
Background
RNA-binding proteins (RBPs) play diverse roles in eukaryotic RNA processing. Despite their pervasive functions in coding and noncoding RNA biogenesis and regulation, elucidating the sequence specificities that define protein-RNA interactions remains a major challenge. Recently, CLIP-seq (Cross-linking immunoprecipitation followed by high-throughput sequencing) has been successfully implemented to study the transcriptome-wide binding patterns of SRSF1, PTBP1, NOVA and fox2 proteins. These studies either adopted traditional methods like Multiple EM for Motif Elicitation (MEME) to discover the sequence consensus of RBP's binding sites or used Z-score statistics to search for the overrepresented nucleotides of a certain size. We argue that most of these methods are not well-suited for RNA motif identification, as they are unable to incorporate the RNA structural context of protein-RNA interactions, which may affect to binding specificity. Here, we describe a novel model-based approach--RNAMotifModeler to identify the consensus of protein-RNA binding regions by integrating sequence features and RNA secondary structures.
Results
As an example, we implemented RNAMotifModeler on SRSF1 (SF2/ASF) CLIP-seq data. The sequence-structural consensus we identified is a purine-rich octamer 'AGAAGAAG' in a highly single-stranded RNA context. The unpaired probabilities, the probabilities of not forming pairs, are significantly higher than negative controls and the flanking sequence surrounding the binding site, indicating that SRSF1 proteins tend to bind on single-stranded RNA. Further statistical evaluations revealed that the second and fifth bases of SRSF1octamer motif have much stronger sequence specificities, but weaker single-strandedness, while the third, fourth, sixth and seventh bases are far more likely to be single-stranded, but have more degenerate sequence specificities. Therefore, we hypothesize that nucleotide specificity and secondary structure play complementary roles during binding site recognition by SRSF1.
Conclusion
In this study, we presented a computational model to predict the sequence consensus and optimal RNA secondary structure for protein-RNA binding regions. The successful implementation on SRSF1 CLIP-seq data demonstrates great potential to improve our understanding on the binding specificity of RNA binding proteins.
Springer Science and Business Media LLC
Title: Predicting sequence and structural specificities of RNA binding regions recognized by splicing factor SRSF1
Description:
Abstract
Background
RNA-binding proteins (RBPs) play diverse roles in eukaryotic RNA processing.
Despite their pervasive functions in coding and noncoding RNA biogenesis and regulation, elucidating the sequence specificities that define protein-RNA interactions remains a major challenge.
Recently, CLIP-seq (Cross-linking immunoprecipitation followed by high-throughput sequencing) has been successfully implemented to study the transcriptome-wide binding patterns of SRSF1, PTBP1, NOVA and fox2 proteins.
These studies either adopted traditional methods like Multiple EM for Motif Elicitation (MEME) to discover the sequence consensus of RBP's binding sites or used Z-score statistics to search for the overrepresented nucleotides of a certain size.
We argue that most of these methods are not well-suited for RNA motif identification, as they are unable to incorporate the RNA structural context of protein-RNA interactions, which may affect to binding specificity.
Here, we describe a novel model-based approach--RNAMotifModeler to identify the consensus of protein-RNA binding regions by integrating sequence features and RNA secondary structures.
Results
As an example, we implemented RNAMotifModeler on SRSF1 (SF2/ASF) CLIP-seq data.
The sequence-structural consensus we identified is a purine-rich octamer 'AGAAGAAG' in a highly single-stranded RNA context.
The unpaired probabilities, the probabilities of not forming pairs, are significantly higher than negative controls and the flanking sequence surrounding the binding site, indicating that SRSF1 proteins tend to bind on single-stranded RNA.
Further statistical evaluations revealed that the second and fifth bases of SRSF1octamer motif have much stronger sequence specificities, but weaker single-strandedness, while the third, fourth, sixth and seventh bases are far more likely to be single-stranded, but have more degenerate sequence specificities.
Therefore, we hypothesize that nucleotide specificity and secondary structure play complementary roles during binding site recognition by SRSF1.
Conclusion
In this study, we presented a computational model to predict the sequence consensus and optimal RNA secondary structure for protein-RNA binding regions.
The successful implementation on SRSF1 CLIP-seq data demonstrates great potential to improve our understanding on the binding specificity of RNA binding proteins.
Related Results
CircSMARCA5 Inhibits Migration of Glioblastoma Multiforme Cells by Regulating a Molecular Axis Involving Splicing Factors SRSF1/SRSF3/PTB
CircSMARCA5 Inhibits Migration of Glioblastoma Multiforme Cells by Regulating a Molecular Axis Involving Splicing Factors SRSF1/SRSF3/PTB
Circular RNAs (circRNAs) have recently emerged as a new class of RNAs, highly enriched in the brain and very stable within cells, exosomes and body fluids. To analyze their involve...
Regulation of Alternative Splicing in B-Cell ALL By DYRK1A
Regulation of Alternative Splicing in B-Cell ALL By DYRK1A
DYRK1A, located in the Down syndrome critical region of chromosome 21, is a serine and threonine kinase that controls multiple cellular processes including apoptosis, cell cycle, t...
Abstract 778: Dysregulation of alternative mRNA splicing by oncogenic KRAS in lung adenocarcinoma
Abstract 778: Dysregulation of alternative mRNA splicing by oncogenic KRAS in lung adenocarcinoma
Abstract
Alternative mRNA splicing is dysregulated in many cancers including lung adenocarcinoma. These aberrant splicing events can sometimes be explained by mutati...
Detecting RNA–RNA interactome
Detecting RNA–RNA interactome
AbstractThe last decade has seen a robust increase in various types of novel RNA molecules and their complexity in gene regulation. RNA molecules play a critical role in cellular e...
CircSMARCA5 Regulates VEGFA mRNA Splicing and Angiogenesis in Glioblastoma Multiforme Through the Binding of SRSF1
CircSMARCA5 Regulates VEGFA mRNA Splicing and Angiogenesis in Glioblastoma Multiforme Through the Binding of SRSF1
Circular RNAs are a large group of RNAs whose cellular functions are still being investigated. We recently proposed that circSMARCA5 acts as sponge for the splicing factor Serine a...
Abstract 1423: Integrative analysis predicts lncRNA regulating gene alternative splicing in breast cancer
Abstract 1423: Integrative analysis predicts lncRNA regulating gene alternative splicing in breast cancer
Abstract
Background: Non-coding region occupies 98% of the whole human genome and plays a
regulatory role for protein-coding genes. About 95% of the p...
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
RNA-binding proteins shape biology through their widespread functions in RNA biochemistry. Their function requires the recognition of specific RNA motifs for targeted binding. Thes...
HnRNPA2B1 tunes antimycobacterial immune responses in macrophages through alternative splicing of
Irgm1
HnRNPA2B1 tunes antimycobacterial immune responses in macrophages through alternative splicing of
Irgm1
ABSTRACT
Onset and progression of active tuberculosis disease result from upsetting the delicate balance between Mtb virulence and host defenses. Because it dynamic...

