Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Ensemble Machine Learning to “Boost” Ubiquitination-sites Prediction

View through CrossRef
ABSTRACT Ubiquitination-site prediction is an important task because ubiquitination is a critical regulatory function for many biological processes such as proteasome degradation, DNA repair and transcription, signal transduction, endocytoses, and sorting. However, the highly dynamic and reversible nature of ubiquitination makes it difficult to experimentally identify specific ubiquitination sites. In this paper, we explore the possibility of improving the prediction of ubiquitination sites using ensemble machine learning methods including Random Forrest (RF), Adaptive Boosting (ADB), Gradient Boosting (GB), and eXtreme Gradient Boosting (XGB). By doing grid search with the four ensemble methods and six comparison non-ensemble learning methods including Naïve Base (NB), Logistic Regression (LR), Decision Trees (DT), Support Vector Machine (SVM), LASSO, and K-Nearest Neighbor (KNN), we find that all the four ensemble methods significantly outperform one or more non-ensemble methods included in this study. XGB outperforms three out of the six non-ensemble methods that we included; ADB and RF both outperform two of the six non-ensemble methods; GB outperforms one non-ensemble method. Comparing the four ensemble methods among themselves. GB performs the worst; XGB and ADB are very comparable in terms of prediction, but ADB beats XGB by far in terms of both the unit model training time and total running time. Both XGB and ADB tend to do better than RF in terms of prediction, but RF has the shortest unit model training time out of the three. In addition, we notice that ADB tends to outperform XGB when dealing with small-scale datasets, and RF can outperform either ADB or XGB when data are less balanced. Interestingly, we find that SVM, LR, and LASSO, three of the six non-ensemble methods included, perform comparably with all the ensemble methods. Based on this study, ensemble learning is a promising approach to ignificantly improving ubiquitination-site prediction using protein segment data.
Title: Ensemble Machine Learning to “Boost” Ubiquitination-sites Prediction
Description:
ABSTRACT Ubiquitination-site prediction is an important task because ubiquitination is a critical regulatory function for many biological processes such as proteasome degradation, DNA repair and transcription, signal transduction, endocytoses, and sorting.
However, the highly dynamic and reversible nature of ubiquitination makes it difficult to experimentally identify specific ubiquitination sites.
In this paper, we explore the possibility of improving the prediction of ubiquitination sites using ensemble machine learning methods including Random Forrest (RF), Adaptive Boosting (ADB), Gradient Boosting (GB), and eXtreme Gradient Boosting (XGB).
By doing grid search with the four ensemble methods and six comparison non-ensemble learning methods including Naïve Base (NB), Logistic Regression (LR), Decision Trees (DT), Support Vector Machine (SVM), LASSO, and K-Nearest Neighbor (KNN), we find that all the four ensemble methods significantly outperform one or more non-ensemble methods included in this study.
XGB outperforms three out of the six non-ensemble methods that we included; ADB and RF both outperform two of the six non-ensemble methods; GB outperforms one non-ensemble method.
Comparing the four ensemble methods among themselves.
GB performs the worst; XGB and ADB are very comparable in terms of prediction, but ADB beats XGB by far in terms of both the unit model training time and total running time.
Both XGB and ADB tend to do better than RF in terms of prediction, but RF has the shortest unit model training time out of the three.
In addition, we notice that ADB tends to outperform XGB when dealing with small-scale datasets, and RF can outperform either ADB or XGB when data are less balanced.
Interestingly, we find that SVM, LR, and LASSO, three of the six non-ensemble methods included, perform comparably with all the ensemble methods.
Based on this study, ensemble learning is a promising approach to ignificantly improving ubiquitination-site prediction using protein segment data.

Related Results

Computational methods for ubiquitination site prediction using physicochemical properties of protein sequences
Computational methods for ubiquitination site prediction using physicochemical properties of protein sequences
Abstract Background Ubiquitination is a very important process in protein post-translational modification, which has been widely investigated by bio...
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
Selection of Injectable Drug Product Composition using Machine Learning Models (Preprint)
BACKGROUND As of July 2020, a Web of Science search of “machine learning (ML)” nested within the search of “pharmacokinetics or pharmacodynamics” yielded over 100...
UBI-XGB: IDENTIFICATION OF UBIQUITIN PROTEINS USING MACHINE LEARNING MODEL
UBI-XGB: IDENTIFICATION OF UBIQUITIN PROTEINS USING MACHINE LEARNING MODEL
A recent line of research has focused on Ubiquitination, a pervasive and proteasome-mediated protein degradation that controls apoptosis and is crucial in the breakdown of proteins...
[RETRACTED] Rhino XL Male Enhancement v1
[RETRACTED] Rhino XL Male Enhancement v1
[RETRACTED]Rhino XL Reviews, NY USA: Studies show that testosterone levels in males decrease constantly with growing age. There are also many other problems that males face due ...
Improve Ubiquitination-site Prediction Using Physicochemical Properties and Machine Learning with Grid Search
Improve Ubiquitination-site Prediction Using Physicochemical Properties and Machine Learning with Grid Search
ABSTRACT Background Ubiquitination plays an important role in protein post-translational processes and has been found to be involved in a number of regulatory functions including p...
Bendless-mediated K63 ubiquitination modulates cellular signalling to regulate Drosophila hematopoiesis
Bendless-mediated K63 ubiquitination modulates cellular signalling to regulate Drosophila hematopoiesis
Abstract Ubiquitination is a reversible modification whose traditional role has been associated with K48-linked poly-ubiquitination involved in proteasomal degradation....
Cbl-mediated K63-linked ubiquitination of JAK2 enhances JAK2 phosphorylation and signal transduction
Cbl-mediated K63-linked ubiquitination of JAK2 enhances JAK2 phosphorylation and signal transduction
AbstractJAK2 activation is crucial for cytokine receptor signal transduction and leukemogenesis. However, the underlying processes that lead to full activation of JAK2 are unclear....

Back to Top