Javascript must be enabled to continue!
Knotted artifacts in predicted 3D RNA structures
View through CrossRef
AbstractUnlike proteins, RNAs deposited in the Protein Data Bank do not contain topological knots. Recently, admittedly, the first trefoil knot and some lasso-type conformations have been found in experimental RNA structures, but these are still exceptional cases. Meanwhile, algorithms predicting 3D RNA models have happened to form knotted structures not so rarely. Interestingly, machine learning-based predictors seem to be more prone to generate knotted RNA folds than traditional methods. A similar situation is observed for the entanglements of structural elements. In this paper, we analyze all models submitted to the CASP15 competition in the 3D RNA structure prediction category. We show what types of topological knots and structure element entanglements appear in the submitted models and highlight what methods are behind the generation of such conformations. We also study the structural aspect of susceptibility to entanglement. We suggest that predictors take care of an evaluation of RNA models to avoid publishing structures with artifacts, such as unusual entanglements, that result from hallucinations of predictive algorithms.Author summary3D RNA structure prediction contests such as CASP and RNA-Puzzles lack measures for topology-wise evaluation of predicted models. Thus, predictors happen to submit potentially inappropriate conformations, for example, containing entanglements that are prediction artifacts.Automated identification of entanglements in 3D RNA structures is computationally hard. Distinguishing correct from incorrectly entangled conformations is not trivial and often requires expert knowledge.We analyzed 3D RNA models submitted to CASP15 and found that all entanglements in these models are artifacts.Compared to non-ML, machine learning-based methods are more prone to generating entanglements that are not present in natural RNAs.To increase the reliability of 3D RNA structure prediction, it is necessary to reject abnormally entangled structures in the modeling stage.
Cold Spring Harbor Laboratory
Title: Knotted artifacts in predicted 3D RNA structures
Description:
AbstractUnlike proteins, RNAs deposited in the Protein Data Bank do not contain topological knots.
Recently, admittedly, the first trefoil knot and some lasso-type conformations have been found in experimental RNA structures, but these are still exceptional cases.
Meanwhile, algorithms predicting 3D RNA models have happened to form knotted structures not so rarely.
Interestingly, machine learning-based predictors seem to be more prone to generate knotted RNA folds than traditional methods.
A similar situation is observed for the entanglements of structural elements.
In this paper, we analyze all models submitted to the CASP15 competition in the 3D RNA structure prediction category.
We show what types of topological knots and structure element entanglements appear in the submitted models and highlight what methods are behind the generation of such conformations.
We also study the structural aspect of susceptibility to entanglement.
We suggest that predictors take care of an evaluation of RNA models to avoid publishing structures with artifacts, such as unusual entanglements, that result from hallucinations of predictive algorithms.
Author summary3D RNA structure prediction contests such as CASP and RNA-Puzzles lack measures for topology-wise evaluation of predicted models.
Thus, predictors happen to submit potentially inappropriate conformations, for example, containing entanglements that are prediction artifacts.
Automated identification of entanglements in 3D RNA structures is computationally hard.
Distinguishing correct from incorrectly entangled conformations is not trivial and often requires expert knowledge.
We analyzed 3D RNA models submitted to CASP15 and found that all entanglements in these models are artifacts.
Compared to non-ML, machine learning-based methods are more prone to generating entanglements that are not present in natural RNAs.
To increase the reliability of 3D RNA structure prediction, it is necessary to reject abnormally entangled structures in the modeling stage.
Related Results
B-247 BLADE-R: streamlined RNA extraction for clinical diagnostics and high-throughput applications
B-247 BLADE-R: streamlined RNA extraction for clinical diagnostics and high-throughput applications
Abstract
Background
Efficient nucleic acid extraction and purification are crucial for cellular and molecular biology research, ...
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
RNA-binding proteins shape biology through their widespread functions in RNA biochemistry. Their function requires the recognition of specific RNA motifs for targeted binding. Thes...
Detection of Multiple Types of Cancer Driver Mutations Using Targeted RNA Sequencing in NSCLC
Detection of Multiple Types of Cancer Driver Mutations Using Targeted RNA Sequencing in NSCLC
ABSTRACTCurrently, DNA and RNA are used separately to capture different types of gene mutations. DNA is commonly used for the detection of SNVs, indels and CNVs; RNA is used for an...
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
Accurate in silico predictions of modified RNA interactions to a prototypical RNA-binding protein with λ-dynamics
RNA-binding proteins shape biology through their widespread functions in RNA biochemistry. Their function requires the recognition of specific RNA motifs for targeted binding. Thes...
Molecular Drivers of RNA Phase Separation
Molecular Drivers of RNA Phase Separation
AbstractRNA molecules are essential in orchestrating the assembly of biomolecular condensates and membraneless compartments in cells. Many condensates form via the association of R...
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract P1-05-23: Utilities and challenges of RNA-Seq based expression and variant calling in a clinical setting
Abstract
Introduction
Variant calling based on DNA samples has been the gold standard of clinical testing since the advent of Sanger sequencing. The u...
Abstract 2323: Deciphering RNA degradation: Insights from a comparative analysis of paired fresh frozen/FFPE total RNA-seq
Abstract 2323: Deciphering RNA degradation: Insights from a comparative analysis of paired fresh frozen/FFPE total RNA-seq
Abstract
Background: Fresh frozen (FF) and formalin-fixed paraffin-embedded (FFPE) samples are primary resources for archival tissues in cancer studies. Despite the ...
Conformational dynamics of RNA
Conformational dynamics of RNA
Two projects are presented in this thesis, showcasing the application of PELDOR spectroscopy, and, more specifically, orientation-selective PELDOR spectroscopy to the study of the ...

