Javascript must be enabled to continue!
metapredict: a fast, accurate, and easy-to-use predictor of consensus disorder and structure
View through CrossRef
Abstract
Intrinsically disordered proteins and protein regions make up a substantial fraction of many proteomes where they play a wide variety of essential roles. A critical first step in understanding the role of disordered protein regions in biological function is to identify those disordered regions correctly. Computational methods for disorder prediction have emerged as a core set of tools to guide experiments, interpret results, and develop hypotheses. Given the multiple different predictors available, consensus scores have emerged as a popular approach to mitigate biases or limitations of any single method. Consensus scores integrate the outcome of multiple independent disorder predictors and provide a per-residue value that reflects the number of tools that predict a residue to be disordered. Although consensus scores help mitigate the inherent problems of using any single disorder predictor, they are computationally expensive to generate. They also necessitate the installation of multiple different software tools, which can be prohibitively difficult. To address this challenge, we developed a deep-learning-based predictor of consensus disorder scores. Our predictor, metapredict, utilizes a bidirectional recurrent neural network trained on the consensus disorder scores from 12 proteomes. By benchmarking metapredict using two orthogonal approaches, we found that metapredict is among the most accurate disorder predictors currently available. Metapredict is also remarkably fast, enabling proteome-scale disorder prediction in minutes. Importantly, metapredict is fully open source and is distributed as a Python package, a collection of command-line tools, and a web server, maximizing the potential practical utility of the predictor. We believe metapredict offers a convenient, accessible, accurate, and high-performance predictor for single-proteins and proteomes alike.
Statement of Significance
Intrinsically disordered regions are found across all kingdoms of life where they play a variety of essential roles. Being able to accurately and quickly identify disordered regions in proteins using just the amino acid sequence is critical for the appropriate design and interpretation of experiments. Despite this, performing large-scale disorder prediction on thousands of sequences is challenging using extant disorder predictors due to various difficulties including general installation and computational requirements. We have developed an accurate, high-performance and easy-to-use predictor of protein disorder and structure. Our predictor, metapredict, was designed for both proteome-scale analysis and individual sequence predictions alike. Metapredict is implemented as a collection of local tools and an online web server, and is appropriate for both seasoned computational biologists and novices alike.
Title: metapredict: a fast, accurate, and easy-to-use predictor of consensus disorder and structure
Description:
Abstract
Intrinsically disordered proteins and protein regions make up a substantial fraction of many proteomes where they play a wide variety of essential roles.
A critical first step in understanding the role of disordered protein regions in biological function is to identify those disordered regions correctly.
Computational methods for disorder prediction have emerged as a core set of tools to guide experiments, interpret results, and develop hypotheses.
Given the multiple different predictors available, consensus scores have emerged as a popular approach to mitigate biases or limitations of any single method.
Consensus scores integrate the outcome of multiple independent disorder predictors and provide a per-residue value that reflects the number of tools that predict a residue to be disordered.
Although consensus scores help mitigate the inherent problems of using any single disorder predictor, they are computationally expensive to generate.
They also necessitate the installation of multiple different software tools, which can be prohibitively difficult.
To address this challenge, we developed a deep-learning-based predictor of consensus disorder scores.
Our predictor, metapredict, utilizes a bidirectional recurrent neural network trained on the consensus disorder scores from 12 proteomes.
By benchmarking metapredict using two orthogonal approaches, we found that metapredict is among the most accurate disorder predictors currently available.
Metapredict is also remarkably fast, enabling proteome-scale disorder prediction in minutes.
Importantly, metapredict is fully open source and is distributed as a Python package, a collection of command-line tools, and a web server, maximizing the potential practical utility of the predictor.
We believe metapredict offers a convenient, accessible, accurate, and high-performance predictor for single-proteins and proteomes alike.
Statement of Significance
Intrinsically disordered regions are found across all kingdoms of life where they play a variety of essential roles.
Being able to accurately and quickly identify disordered regions in proteins using just the amino acid sequence is critical for the appropriate design and interpretation of experiments.
Despite this, performing large-scale disorder prediction on thousands of sequences is challenging using extant disorder predictors due to various difficulties including general installation and computational requirements.
We have developed an accurate, high-performance and easy-to-use predictor of protein disorder and structure.
Our predictor, metapredict, was designed for both proteome-scale analysis and individual sequence predictions alike.
Metapredict is implemented as a collection of local tools and an online web server, and is appropriate for both seasoned computational biologists and novices alike.
Related Results
The Role of Personality and Stress in the Development of Substance Use Disorder among Substance Use Disorder Patients in Kaduna State, Nigeria
The Role of Personality and Stress in the Development of Substance Use Disorder among Substance Use Disorder Patients in Kaduna State, Nigeria
This study investigated the role of personality and stress in the development of substance use disorder among substance use disorder patients in Kaduna State. A cross sectional des...
Suicidal Ideation and Attempt among Alcohol use Disorder Patients Attending Addiction Clinic at Amanuel Mental Specialized Hospital, Addis Ababa, Ethiopia
Suicidal Ideation and Attempt among Alcohol use Disorder Patients Attending Addiction Clinic at Amanuel Mental Specialized Hospital, Addis Ababa, Ethiopia
Background: Alcohol is one of commonly used psychoactive substances. Harmful use of alcohol results in a cluster of behavioral and physical symptoms known as Alcohol use disorder. ...
CohortSync: Scalable Micro-Cohort-Based Protocol for Consensus and Reconciliation in Distributed Systems
CohortSync: Scalable Micro-Cohort-Based Protocol for Consensus and Reconciliation in Distributed Systems
In modern distributed systems, achieving consensus and reconciliation among diverse nodes across varying network conditions is a significant challenge. CohortSync, a novel micro-co...
Comparative Analysis of Consensus Algorithms in Blockchain
Comparative Analysis of Consensus Algorithms in Blockchain
A blockchain is a decentralized, shared, and public digital ledger that is used to log transactions across many devices so that the record cannot be altered, deleted, or destroyed ...
Bleeding Disorder Referrals to Hematology Clinic: A Single Institution Experience
Bleeding Disorder Referrals to Hematology Clinic: A Single Institution Experience
BACKGROUND
Our tertiary care pediatric hematology/oncology/BMT service receives hundreds of referrals yearly for bleeding disorder evaluation both due to bleeding sy...
RIFRAF: a frame-resolving consensus algorithm
RIFRAF: a frame-resolving consensus algorithm
AbstractMotivationProtein coding genes can be studied using long-read next generation sequencing. However, high rates of indel sequencing errors are problematic, corrupting the rea...
Investigating the Impact of Consensus Algorithm on Scalability in Blockchain Systems
Investigating the Impact of Consensus Algorithm on Scalability in Blockchain Systems
In the current era, blockchain has emerged as one the best and promising technology. All the cryptocurrencies have also gained a lot of popularity around the globe which are based ...
Adjustment Disorder and Its Clinical Management
Adjustment Disorder and Its Clinical Management
Adjustment disorder is a condition of subjective emotional distress triggered as a consequence of a meaningful change in life. The diagnosis of adjustment disorder is hindered by t...

