Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Predicting sequence and structural specificities of RNA binding regions recognized by splicing factor SRSF1

View through CrossRef
Abstract Background RNA-binding proteins (RBPs) play diverse roles in eukaryotic RNA processing. Despite their pervasive functions in coding and noncoding RNA biogenesis and regulation, elucidating the sequence specificities that define protein-RNA interactions remains a major challenge. Recently, CLIP-seq (Cross-linking immunoprecipitation followed by high-throughput sequencing) has been successfully implemented to study the transcriptome-wide binding patterns of SRSF1, PTBP1, NOVA and fox2 proteins. These studies either adopted traditional methods like Multiple EM for Motif Elicitation (MEME) to discover the sequence consensus of RBP's binding sites or used Z-score statistics to search for the overrepresented nucleotides of a certain size. We argue that most of these methods are not well-suited for RNA motif identification, as they are unable to incorporate the RNA structural context of protein-RNA interactions, which may affect to binding specificity. Here, we describe a novel model-based approach--RNAMotifModeler to identify the consensus of protein-RNA binding regions by integrating sequence features and RNA secondary structures. Results As an example, we implemented RNAMotifModeler on SRSF1 (SF2/ASF) CLIP-seq data. The sequence-structural consensus we identified is a purine-rich octamer 'AGAAGAAG' in a highly single-stranded RNA context. The unpaired probabilities, the probabilities of not forming pairs, are significantly higher than negative controls and the flanking sequence surrounding the binding site, indicating that SRSF1 proteins tend to bind on single-stranded RNA. Further statistical evaluations revealed that the second and fifth bases of SRSF1octamer motif have much stronger sequence specificities, but weaker single-strandedness, while the third, fourth, sixth and seventh bases are far more likely to be single-stranded, but have more degenerate sequence specificities. Therefore, we hypothesize that nucleotide specificity and secondary structure play complementary roles during binding site recognition by SRSF1. Conclusion In this study, we presented a computational model to predict the sequence consensus and optimal RNA secondary structure for protein-RNA binding regions. The successful implementation on SRSF1 CLIP-seq data demonstrates great potential to improve our understanding on the binding specificity of RNA binding proteins.
Title: Predicting sequence and structural specificities of RNA binding regions recognized by splicing factor SRSF1
Description:
Abstract Background RNA-binding proteins (RBPs) play diverse roles in eukaryotic RNA processing.
Despite their pervasive functions in coding and noncoding RNA biogenesis and regulation, elucidating the sequence specificities that define protein-RNA interactions remains a major challenge.
Recently, CLIP-seq (Cross-linking immunoprecipitation followed by high-throughput sequencing) has been successfully implemented to study the transcriptome-wide binding patterns of SRSF1, PTBP1, NOVA and fox2 proteins.
These studies either adopted traditional methods like Multiple EM for Motif Elicitation (MEME) to discover the sequence consensus of RBP's binding sites or used Z-score statistics to search for the overrepresented nucleotides of a certain size.
We argue that most of these methods are not well-suited for RNA motif identification, as they are unable to incorporate the RNA structural context of protein-RNA interactions, which may affect to binding specificity.
Here, we describe a novel model-based approach--RNAMotifModeler to identify the consensus of protein-RNA binding regions by integrating sequence features and RNA secondary structures.
Results As an example, we implemented RNAMotifModeler on SRSF1 (SF2/ASF) CLIP-seq data.
The sequence-structural consensus we identified is a purine-rich octamer 'AGAAGAAG' in a highly single-stranded RNA context.
The unpaired probabilities, the probabilities of not forming pairs, are significantly higher than negative controls and the flanking sequence surrounding the binding site, indicating that SRSF1 proteins tend to bind on single-stranded RNA.
Further statistical evaluations revealed that the second and fifth bases of SRSF1octamer motif have much stronger sequence specificities, but weaker single-strandedness, while the third, fourth, sixth and seventh bases are far more likely to be single-stranded, but have more degenerate sequence specificities.
Therefore, we hypothesize that nucleotide specificity and secondary structure play complementary roles during binding site recognition by SRSF1.
Conclusion In this study, we presented a computational model to predict the sequence consensus and optimal RNA secondary structure for protein-RNA binding regions.
The successful implementation on SRSF1 CLIP-seq data demonstrates great potential to improve our understanding on the binding specificity of RNA binding proteins.

Related Results

Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
RNA-binding proteins shape biology through their widespread functions in RNA biochemistry. Their function requires the recognition of specific RNA motifs for targeted binding. Thes...
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
RNA-binding proteins shape biology through their widespread functions in RNA biochemistry. Their function requires the recognition of specific RNA motifs for targeted binding. Thes...
Nuclear Encoded RNA Splicing Factors in Plant Mitochondria
Nuclear Encoded RNA Splicing Factors in Plant Mitochondria
Mitochondria are the site of respiration and numerous other metabolic processes required for plant growth and development. Increased demands for metabolic energy are observed durin...
Alcohol use disorder causes global changes in splicing in the human brain
Alcohol use disorder causes global changes in splicing in the human brain
AbstractAlcohol use disorder (AUD) is a widespread disease leading to the deterioration of cognitive and other functions. Mechanisms by which alcohol affects the brain are not full...
B-247 BLADE-R: streamlined RNA extraction for clinical diagnostics and high-throughput applications
B-247 BLADE-R: streamlined RNA extraction for clinical diagnostics and high-throughput applications
Abstract Background Efficient nucleic acid extraction and purification are crucial for cellular and molecular biology research, ...
Alternative Splicing of Serum Response Factor Reveals Isoform-Specific Remodeling in Cardiac Diseases
Alternative Splicing of Serum Response Factor Reveals Isoform-Specific Remodeling in Cardiac Diseases
Background: Alternative splicing is an important mechanism of transcriptomic and proteomic diversity and is progressively involved in cardiovascular disease (CVD) pathogenesis. Ser...
CD44 alternative splicing is a sensor of intragenic DNA methylation in tumors
CD44 alternative splicing is a sensor of intragenic DNA methylation in tumors
ABSTRACT DNA methylation (meDNA) is a suspected modulator of alternative splicing, while splicing in turn is involved in tumour formations nearly as frequently as D...

Back to Top