Javascript must be enabled to continue!
Improve Protein Solubility and Activity based on Machine Learning Models
View through CrossRef
Abstract
Improving catalytic ability of protein biocatalysts leads to reduction in the production cost of biocatalytic manufacturing process, but the search space of possible proteins/mutants is too large to explore exhaustively through experiments. To some extent, highly soluble recombinant proteins tend to exhibit high activity. Here, we demonstrate that an optimization methodology based on machine learning prediction model can effectively predict which peptide tags can improve protein solubility quantitatively. Based on the protein sequence information, a support vector machine model we recently developed was used to evaluate protein solubility after randomly mutated tags were added to a target protein. The optimization algorithm guided the tags to evolve towards variants that can result in higher solubility. Moreover, the optimization results were validated successfully by adding the tags designed by our optimization algorithm to a model protein, expressing it
in vivo
and experimentally quantifying its solubility and activity. For example, solubility of a tyrosine ammonium lyase was more than doubled by adding two tags to its N- and C-terminus. Its protein activity was also increased nearly 3.5 fold by adding the tags. Additional experiments also supported that the designed tags were effective for improving activity of multiple proteins and are better than previously reported tags. The presented optimization methodology thus provides a valuable tool for understanding the correlation between amino acid sequence and protein solubility and for engineering protein biocatalysts.
Contact
kang.zhou@nus.edu.sg
,
chewxia@nus.edu.sg
Title: Improve Protein Solubility and Activity based on Machine Learning Models
Description:
Abstract
Improving catalytic ability of protein biocatalysts leads to reduction in the production cost of biocatalytic manufacturing process, but the search space of possible proteins/mutants is too large to explore exhaustively through experiments.
To some extent, highly soluble recombinant proteins tend to exhibit high activity.
Here, we demonstrate that an optimization methodology based on machine learning prediction model can effectively predict which peptide tags can improve protein solubility quantitatively.
Based on the protein sequence information, a support vector machine model we recently developed was used to evaluate protein solubility after randomly mutated tags were added to a target protein.
The optimization algorithm guided the tags to evolve towards variants that can result in higher solubility.
Moreover, the optimization results were validated successfully by adding the tags designed by our optimization algorithm to a model protein, expressing it
in vivo
and experimentally quantifying its solubility and activity.
For example, solubility of a tyrosine ammonium lyase was more than doubled by adding two tags to its N- and C-terminus.
Its protein activity was also increased nearly 3.
5 fold by adding the tags.
Additional experiments also supported that the designed tags were effective for improving activity of multiple proteins and are better than previously reported tags.
The presented optimization methodology thus provides a valuable tool for understanding the correlation between amino acid sequence and protein solubility and for engineering protein biocatalysts.
Contact
kang.
zhou@nus.
edu.
sg
,
chewxia@nus.
edu.
sg.
Related Results
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND
As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract
The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Correlation of the Solubility Parameter and Antibacterial Activity of Moxifloxacin
Correlation of the Solubility Parameter and Antibacterial Activity of Moxifloxacin
An attempt is made to determine the solubility parameter of moxifloxacin HCl and to establish the correlation between solubility parameter and drug properties using different metho...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...
Knowledge Graph for Solubility Big Data: Construction and Applications
Knowledge Graph for Solubility Big Data: Construction and Applications
ABSTRACTDissolution refers to the process in which solvent molecules and solute molecules attract and combine with each other. The extensive solubility data generated from the diss...
Determination of Saturated Solubility of Mirtazapine Using UV Visible Spectrophotometer
Determination of Saturated Solubility of Mirtazapine Using UV Visible Spectrophotometer
Solubility is an important parameter for designing new drug formulations. Many drugs possess poor aqueous solubility hence, poor bioavailability. Many pharmaceutical industries fac...
The Solubility-Permeability Interplay for Solubility-Enabling Oral Formulations
The Solubility-Permeability Interplay for Solubility-Enabling Oral Formulations
The Biopharmaceutical classification system (BCS) classifies the drugs based on their intrinsic
solubility and intestinal permeability. The drugs with good solubility and intestina...

