Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Validating subcellular localization prediction tools with mycobacterial proteins

View through CrossRef
Abstract Background The computational prediction of mycobacterial proteins' subcellular localization is of key importance for proteome annotation and for the identification of new drug targets and vaccine candidates. Several subcellular localization classifiers have been developed over the past few years, which have comprised both general localization and feature-based classifiers. Here, we have validated the ability of different bioinformatics approaches, through the use of SignalP 2.0, TatP 1.0, LipoP 1.0, Phobius, PA-SUB 2.5, PSORTb v.2.0.4 and Gpos-PLoc, to predict secreted bacterial proteins. These computational tools were compared in terms of sensitivity, specificity and Matthew's correlation coefficient (MCC) using a set of mycobacterial proteins having less than 40% identity, none of which are included in the training data sets of the validated tools and whose subcellular localization have been experimentally confirmed. These proteins belong to the TBpred training data set, a computational tool specifically designed to predict mycobacterial proteins. Results A final validation set of 272 mycobacterial proteins was obtained from the initial set of 852 mycobacterial proteins. According to the results of the validation metrics, all tools presented specificity above 0.90, while dispersion sensitivity and MCC values were above 0.22. PA-SUB 2.5 presented the highest values; however, these results might be biased due to the methodology used by this tool. PSORTb v.2.0.4 left 56 proteins out of the classification, while Gpos-PLoc left just one protein out. Conclusion Both subcellular localization approaches had high predictive specificity and high recognition of true negatives for the tested data set. Among those tools whose predictions are not based on homology searches against SWISS-PROT, Gpos-PLoc was the general localization tool with the best predictive performance, while SignalP 2.0 was the best tool among the ones using a feature-based approach. Even though PA-SUB 2.5 presented the highest metrics, it should be taken into account that this tool was trained using all proteins reported in SWISS-PROT, which includes the protein set tested in this study, either as a BLAST search or as a training model.
Title: Validating subcellular localization prediction tools with mycobacterial proteins
Description:
Abstract Background The computational prediction of mycobacterial proteins' subcellular localization is of key importance for proteome annotation and for the identification of new drug targets and vaccine candidates.
Several subcellular localization classifiers have been developed over the past few years, which have comprised both general localization and feature-based classifiers.
Here, we have validated the ability of different bioinformatics approaches, through the use of SignalP 2.
0, TatP 1.
0, LipoP 1.
0, Phobius, PA-SUB 2.
5, PSORTb v.
2.
4 and Gpos-PLoc, to predict secreted bacterial proteins.
These computational tools were compared in terms of sensitivity, specificity and Matthew's correlation coefficient (MCC) using a set of mycobacterial proteins having less than 40% identity, none of which are included in the training data sets of the validated tools and whose subcellular localization have been experimentally confirmed.
These proteins belong to the TBpred training data set, a computational tool specifically designed to predict mycobacterial proteins.
Results A final validation set of 272 mycobacterial proteins was obtained from the initial set of 852 mycobacterial proteins.
According to the results of the validation metrics, all tools presented specificity above 0.
90, while dispersion sensitivity and MCC values were above 0.
22.
PA-SUB 2.
5 presented the highest values; however, these results might be biased due to the methodology used by this tool.
PSORTb v.
2.
4 left 56 proteins out of the classification, while Gpos-PLoc left just one protein out.
Conclusion Both subcellular localization approaches had high predictive specificity and high recognition of true negatives for the tested data set.
Among those tools whose predictions are not based on homology searches against SWISS-PROT, Gpos-PLoc was the general localization tool with the best predictive performance, while SignalP 2.
0 was the best tool among the ones using a feature-based approach.
Even though PA-SUB 2.
5 presented the highest metrics, it should be taken into account that this tool was trained using all proteins reported in SWISS-PROT, which includes the protein set tested in this study, either as a BLAST search or as a training model.

Related Results

Indoor Localization System Based on RSSI-APIT Algorithm
Indoor Localization System Based on RSSI-APIT Algorithm
An indoor localization system based on the RSSI-APIT algorithm is designed in this study. Integrated RSSI (received signal strength indication) and non-ranging APIT (approximate pe...
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features
The prediction of protein subcellular localization is critical for inferring protein functions, gene regulations and protein-protein interactions. With the advances of high-through...
Deep generative model for protein subcellular localization prediction
Deep generative model for protein subcellular localization prediction
AbstractProtein sequence determines not only its structure but also its subcellular localization. Although a series of artificial intelligence models have been reported to predict ...
Perforin, a cytotoxic molecule which mediates cell necrosis, is not required for the early control of mycobacterial infection in mice
Perforin, a cytotoxic molecule which mediates cell necrosis, is not required for the early control of mycobacterial infection in mice
Host defense against mycobacterial infection requires the participation of monocytes and T cells. Both CD4+ and CD8+ T cells have been shown to be important in resistance to mycoba...
SCREENING AND IDENTIFICATION OF ANTI-MYCOBACTERIAL PLANTS
SCREENING AND IDENTIFICATION OF ANTI-MYCOBACTERIAL PLANTS
Tuberculosis (TB) is an infectious disease caused by Mycobacterium tuberculosis (MTb) and is one of the majorcauses of death in the world. One third of the population suffers with ...
Molecular Docking Approaches to Suggest the Anti-Mycobacterial Targets of Natural Products
Molecular Docking Approaches to Suggest the Anti-Mycobacterial Targets of Natural Products
Tuberculosis (TB) is a major global threat mostly due to the development of antibiotic resistant forms of Mycobacterium tuberculosis, the causal agent of the disease. Driven by the...
Identification of heparin‐binding proteins in bovine seminal plasma
Identification of heparin‐binding proteins in bovine seminal plasma
AbstractA group of four similar proteins, BSP‐A1, BSP‐A2, BSP‐A3, and BSP‐30‐kDa, represent the major acidic proteins found in bovine seminal plasma (BSP). These proteins are secre...

Back to Top