Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Sequence-based Prediction of Metamorphic Behavior in Proteins

View through CrossRef
Abstract An increasing number of proteins have been demonstrated in recent years to adopt multiple three-dimensional folds with different functions. These metamorphic proteins are characterized by having two or more folds with significant differences in their secondary structure, where each fold is stabilized by a distinct local environment. So far, about 90 metamorphic proteins have been identified in the Protein Databank (PDB), but we and others hypothesize that a far greater number of metamorphic proteins remain undiscovered. In this work, we introduce a computational model to predict metamorphic behavior in proteins using only knowledge of the sequence. In this model, secondary structure prediction programs are used to calculate diversity indices , which are measures of uncertainty in predicted secondary structure at each position in the sequence; these are then used to assign protein sequences as likely to be metamorphic vs. monomorphic (i.e. having just one-fold). We constructed a reference dataset to train our classification method, which includes a novel compilation of 140 likely-monomorphic proteins, and a set of 201 metamorphic proteins taken from the literature. Our model is able to classify proteins as metamorphic vs. monomorphic with a Matthews correlation coefficient of ∼0.4 and true positive / true negative rates of ∼65% / 80%, suggesting that it is possible to predict metamorphic behavior in proteins using only sequence information. Statement of Significance This paper introduces the diversity index as a descriptor to distinguish metamorphic proteins, which possess multiple stable folds, from monomorphic proteins that possess only one fold. The diversity index is designed to measure uncertainty in computationally predicted secondary structure, which we hypothesize is elevated for metamorphic proteins. We tested our hypothesis by training a binary classifier using the diversity index and an annotated dataset of metamorphic and monomorphic proteins, and found an optimal Matthews correlation coefficient of 0.4, supporting the hypothesis and demonstrating for the first time that it is possible to predict metamorphic behavior in proteins using only sequence information. The sequence-based classifier has broader applicability compared to methods that rely on making comparison to experimentally measured structures.
Title: Sequence-based Prediction of Metamorphic Behavior in Proteins
Description:
Abstract An increasing number of proteins have been demonstrated in recent years to adopt multiple three-dimensional folds with different functions.
These metamorphic proteins are characterized by having two or more folds with significant differences in their secondary structure, where each fold is stabilized by a distinct local environment.
So far, about 90 metamorphic proteins have been identified in the Protein Databank (PDB), but we and others hypothesize that a far greater number of metamorphic proteins remain undiscovered.
In this work, we introduce a computational model to predict metamorphic behavior in proteins using only knowledge of the sequence.
In this model, secondary structure prediction programs are used to calculate diversity indices , which are measures of uncertainty in predicted secondary structure at each position in the sequence; these are then used to assign protein sequences as likely to be metamorphic vs.
monomorphic (i.
e.
having just one-fold).
We constructed a reference dataset to train our classification method, which includes a novel compilation of 140 likely-monomorphic proteins, and a set of 201 metamorphic proteins taken from the literature.
Our model is able to classify proteins as metamorphic vs.
monomorphic with a Matthews correlation coefficient of ∼0.
4 and true positive / true negative rates of ∼65% / 80%, suggesting that it is possible to predict metamorphic behavior in proteins using only sequence information.
Statement of Significance This paper introduces the diversity index as a descriptor to distinguish metamorphic proteins, which possess multiple stable folds, from monomorphic proteins that possess only one fold.
The diversity index is designed to measure uncertainty in computationally predicted secondary structure, which we hypothesize is elevated for metamorphic proteins.
We tested our hypothesis by training a binary classifier using the diversity index and an annotated dataset of metamorphic and monomorphic proteins, and found an optimal Matthews correlation coefficient of 0.
4, supporting the hypothesis and demonstrating for the first time that it is possible to predict metamorphic behavior in proteins using only sequence information.
The sequence-based classifier has broader applicability compared to methods that rely on making comparison to experimentally measured structures.

Related Results

Metamorphic facies: A review and some suggestions for changes
Metamorphic facies: A review and some suggestions for changes
ABSTRACT The concept of metamorphic facies has been used as a tool in the interpretation of metamorphic rocks for almost 100 years. The preferred definition is a set...
Petrology, Geochemistry and Tectonic Significance of the Metamorphic Peridotites from Longmuco–Shuanghu Ophiolitic Melange Belt, Tibet
Petrology, Geochemistry and Tectonic Significance of the Metamorphic Peridotites from Longmuco–Shuanghu Ophiolitic Melange Belt, Tibet
Abstract:Taoxinghu metamorphic peridotite is a firstly reported mantle sequence of ophiolite since Longmuco‐Shuanghu‐Lancangjiang suture zone (LSLSZ) was proposed, and it is also a...
Ultra‐high pressure metamorphic rocks in the Dabie‐Su‐Lu region, China: Their formation and exhumation
Ultra‐high pressure metamorphic rocks in the Dabie‐Su‐Lu region, China: Their formation and exhumation
Abstract Based on petrological, structural, geological and geochronological research, the authors summarize the progress of ultra‐high pressure (UHP) metamorphic rock study since 1...
Caution in linking localized metamorphic P–T paths to tectonic settings
Caution in linking localized metamorphic P–T paths to tectonic settings
Metamorphic pressure-temperature-time (P-T-t) paths are critical records of crustal thermomechanical evolution, yet the growing documentation of both clockwise (CW) and countercloc...
On the significance of short-duration regional metamorphism
On the significance of short-duration regional metamorphism
Short-duration regional metamorphism is a recently observed and poorly understood phenomenon in metamorphic geology. In this review, it is defined as metamorphism on time scales th...

Back to Top