Javascript must be enabled to continue!
Validating subcellular localization prediction tools with mycobacterial proteins
View through CrossRef
Abstract
Background
The computational prediction of mycobacterial proteins' subcellular localization is of key importance for proteome annotation and for the identification of new drug targets and vaccine candidates. Several subcellular localization classifiers have been developed over the past few years, which have comprised both general localization and feature-based classifiers. Here, we have validated the ability of different bioinformatics approaches, through the use of SignalP 2.0, TatP 1.0, LipoP 1.0, Phobius, PA-SUB 2.5, PSORTb v.2.0.4 and Gpos-PLoc, to predict secreted bacterial proteins. These computational tools were compared in terms of sensitivity, specificity and Matthew's correlation coefficient (MCC) using a set of mycobacterial proteins having less than 40% identity, none of which are included in the training data sets of the validated tools and whose subcellular localization have been experimentally confirmed. These proteins belong to the TBpred training data set, a computational tool specifically designed to predict mycobacterial proteins.
Results
A final validation set of 272 mycobacterial proteins was obtained from the initial set of 852 mycobacterial proteins. According to the results of the validation metrics, all tools presented specificity above 0.90, while dispersion sensitivity and MCC values were above 0.22. PA-SUB 2.5 presented the highest values; however, these results might be biased due to the methodology used by this tool. PSORTb v.2.0.4 left 56 proteins out of the classification, while Gpos-PLoc left just one protein out.
Conclusion
Both subcellular localization approaches had high predictive specificity and high recognition of true negatives for the tested data set. Among those tools whose predictions are not based on homology searches against SWISS-PROT, Gpos-PLoc was the general localization tool with the best predictive performance, while SignalP 2.0 was the best tool among the ones using a feature-based approach. Even though PA-SUB 2.5 presented the highest metrics, it should be taken into account that this tool was trained using all proteins reported in SWISS-PROT, which includes the protein set tested in this study, either as a BLAST search or as a training model.
Springer Science and Business Media LLC
Title: Validating subcellular localization prediction tools with mycobacterial proteins
Description:
Abstract
Background
The computational prediction of mycobacterial proteins' subcellular localization is of key importance for proteome annotation and for the identification of new drug targets and vaccine candidates.
Several subcellular localization classifiers have been developed over the past few years, which have comprised both general localization and feature-based classifiers.
Here, we have validated the ability of different bioinformatics approaches, through the use of SignalP 2.
0, TatP 1.
0, LipoP 1.
0, Phobius, PA-SUB 2.
5, PSORTb v.
2.
4 and Gpos-PLoc, to predict secreted bacterial proteins.
These computational tools were compared in terms of sensitivity, specificity and Matthew's correlation coefficient (MCC) using a set of mycobacterial proteins having less than 40% identity, none of which are included in the training data sets of the validated tools and whose subcellular localization have been experimentally confirmed.
These proteins belong to the TBpred training data set, a computational tool specifically designed to predict mycobacterial proteins.
Results
A final validation set of 272 mycobacterial proteins was obtained from the initial set of 852 mycobacterial proteins.
According to the results of the validation metrics, all tools presented specificity above 0.
90, while dispersion sensitivity and MCC values were above 0.
22.
PA-SUB 2.
5 presented the highest values; however, these results might be biased due to the methodology used by this tool.
PSORTb v.
2.
4 left 56 proteins out of the classification, while Gpos-PLoc left just one protein out.
Conclusion
Both subcellular localization approaches had high predictive specificity and high recognition of true negatives for the tested data set.
Among those tools whose predictions are not based on homology searches against SWISS-PROT, Gpos-PLoc was the general localization tool with the best predictive performance, while SignalP 2.
0 was the best tool among the ones using a feature-based approach.
Even though PA-SUB 2.
5 presented the highest metrics, it should be taken into account that this tool was trained using all proteins reported in SWISS-PROT, which includes the protein set tested in this study, either as a BLAST search or as a training model.
Related Results
Indoor Localization System Based on RSSI-APIT Algorithm
Indoor Localization System Based on RSSI-APIT Algorithm
An indoor localization system based on the RSSI-APIT algorithm is designed in this study. Integrated RSSI (received signal strength indication) and non-ranging APIT (approximate pe...
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features
The prediction of protein subcellular localization is critical for inferring protein functions, gene regulations and protein-protein interactions. With the advances of high-through...
Deep generative model for protein subcellular localization prediction
Deep generative model for protein subcellular localization prediction
Abstract
Protein sequence determines not only its structure but also its subcellular localization. Although a series of artificial intelligence m...
PreSubLncR: Predicting Subcellular Localization of Long Non-Coding RNA Based on Multi-Scale Attention Convolutional Network and Bidirectional Long Short-Term Memory Network
PreSubLncR: Predicting Subcellular Localization of Long Non-Coding RNA Based on Multi-Scale Attention Convolutional Network and Bidirectional Long Short-Term Memory Network
The subcellular localization of long non-coding RNA (lncRNA) provides important insights and opportunities for an in-depth understanding of cell biology, revealing disease mechanis...
Perforin, a cytotoxic molecule which mediates cell necrosis, is not required for the early control of mycobacterial infection in mice
Perforin, a cytotoxic molecule which mediates cell necrosis, is not required for the early control of mycobacterial infection in mice
Host defense against mycobacterial infection requires the participation of monocytes and T cells. Both CD4+ and CD8+ T cells have been shown to be important in resistance to mycoba...
Deep generative model for protein subcellular localization prediction
Deep generative model for protein subcellular localization prediction
Abstract
Protein sequence not only determines its structure but also provides important clues of its subcellular localization. Although a series of artificial int...
Comparison of Computed Tomographic Imaging-guided hook wire localization and electromagnetic navigation bronchoscope localization in the resection of pulmonary nodules
Comparison of Computed Tomographic Imaging-guided hook wire localization and electromagnetic navigation bronchoscope localization in the resection of pulmonary nodules
Abstract
Background: The resection of nodules by thoracoscopic surgery is difficult because the nodules may be hard to identify. Currently, preoperative localization of pu...
Identification of heparin‐binding proteins in bovine seminal plasma
Identification of heparin‐binding proteins in bovine seminal plasma
AbstractA group of four similar proteins, BSP‐A1, BSP‐A2, BSP‐A3, and BSP‐30‐kDa, represent the major acidic proteins found in bovine seminal plasma (BSP). These proteins are secre...

