Javascript must be enabled to continue!
Direct Prediction of Intrinsically Disordered Protein Conformational Properties From Sequence
View through CrossRef
ABSTRACT
Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles. While folded domains are generally well-described by a single 3D structure, IDRs exist in a collection of interconverting states known as an ensemble. This structural heterogeneity means IDRs are largely absent from the PDB, contributing to a lack of computational approaches to predict ensemble conformational properties from sequence. Here we combine rational sequence design, large-scale molecular simulations, and deep learning to develop ALBATROSS, a deep learning model for predicting IDR ensemble dimensions from sequence. ALBATROSS enables the instantaneous prediction of ensemble average properties at proteome-wide scale. ALBATROSS is lightweight, easy-to-use, and accessible as both a locally installable software package and a point-and-click style interface in the cloud. We first demonstrate the applicability of our predictors by examining the generalizability of sequence-ensemble relationships in IDRs. Then, we leverage the high-throughput nature of ALBATROSS to characterize emergent biophysical behavior of IDRs within and between proteomes.
Update from previous version
This preprint reports an updated version of the ALBATROSS network weights trained on simulations of over 42,000 sequences.
In addition, we provide new colab notebooks that enable proteome-wide IDR prediction and annotation in minutes.
All conclusions and observations made in versions 1 and 2 of this manuscript remain true and robust.
Title: Direct Prediction of Intrinsically Disordered Protein Conformational Properties From Sequence
Description:
ABSTRACT
Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles.
While folded domains are generally well-described by a single 3D structure, IDRs exist in a collection of interconverting states known as an ensemble.
This structural heterogeneity means IDRs are largely absent from the PDB, contributing to a lack of computational approaches to predict ensemble conformational properties from sequence.
Here we combine rational sequence design, large-scale molecular simulations, and deep learning to develop ALBATROSS, a deep learning model for predicting IDR ensemble dimensions from sequence.
ALBATROSS enables the instantaneous prediction of ensemble average properties at proteome-wide scale.
ALBATROSS is lightweight, easy-to-use, and accessible as both a locally installable software package and a point-and-click style interface in the cloud.
We first demonstrate the applicability of our predictors by examining the generalizability of sequence-ensemble relationships in IDRs.
Then, we leverage the high-throughput nature of ALBATROSS to characterize emergent biophysical behavior of IDRs within and between proteomes.
Update from previous version
This preprint reports an updated version of the ALBATROSS network weights trained on simulations of over 42,000 sequences.
In addition, we provide new colab notebooks that enable proteome-wide IDR prediction and annotation in minutes.
All conclusions and observations made in versions 1 and 2 of this manuscript remain true and robust.
Related Results
Transferable deep generative modeling of intrinsically disordered protein conformations
Transferable deep generative modeling of intrinsically disordered protein conformations
ABSTRACT
Intrinsically disordered proteins have dynamic structures through which they play key biological roles. The elucidation of their conformational ensembles i...
Comparing Population-General and Sport-Specific Correlates of Disordered Eating Amongst Elite Athletes: A Cross-Sectional Study
Comparing Population-General and Sport-Specific Correlates of Disordered Eating Amongst Elite Athletes: A Cross-Sectional Study
Abstract
Background
Despite the high prevalence of disordered eating and eating disorders amongst elite athletes, it remains unclear whether risk fa...
Disentangling folding from energetic traps in simulations of disordered proteins
Disentangling folding from energetic traps in simulations of disordered proteins
ABSTRACT
Protein conformational heterogeneity plays an essential role in a myriad of different biological processes. Extensive conformational heterogeneity is espec...
SARS-CoV-2 NSP1 C-terminal region (residues 130-180) is an intrinsically disordered region
SARS-CoV-2 NSP1 C-terminal region (residues 130-180) is an intrinsically disordered region
Abstract
Nonstructural protein 1 (NSP1) of SARS-CoV-2 plays a key role in downregulation of RIG-I pathways and interacts with 40 S ribosome. Recently, the cryo-EM s...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...
Molecular dynamics studies of intrinsically disordered peptides and proteins
Molecular dynamics studies of intrinsically disordered peptides and proteins
A tremendous amount of evidence has accumulated in regards to the importance of intrinsically disordered proteins (IDPs) in the functioning of the cell and their role in human dise...
Local conformations in ordered and Intrinsically disordered proteins
Local conformations in ordered and Intrinsically disordered proteins
Protein structures are highly dynamic macromolecules. This dynamics is often analysed with a limited number of proteins. In our study, molecular dynamics (MDs) simulations were per...
Local conformations analyses in ordered and intrinsically disordered proteins
Local conformations analyses in ordered and intrinsically disordered proteins
Protein structures are highly dynamic macromolecules. This dynamics is often analysed with a limited number of proteins. In our study, molecular dynamics (MDs) simulations were per...

