Javascript must be enabled to continue!
CGAN-Cmap: protein contact map prediction using deep generative adversarial neural networks
View through CrossRef
Abstract
Protein contact maps represent spatial pairwise inter-residue interactions, providing a protein’s translationally and rotationally invariant topological representation. Accurate contact map prediction has been a critical driving force for improving protein structure prediction, one of computational biology’s most challenging problems in the last half-century. While many computational tools have been developed to this end, most fail to predict accurate contact maps for proteins with insufficient homologous protein sequences, and exhibit low accuracy for long-range contacts. To address these limitations, we develop a novel hybrid model, CGAN-Cmap, that uses a generative adversarial neural network embedded with a series of modified squeeze and excitation residual networks. To exploit features of different dimensions, we build the generator of CGAN-Cmap via two parallel modules: sequential and pairwise modules to capture and interpret distance profiles from 1D sequential and 2D pairwise feature maps, respectively, and combine them during the training process to generate the contact map. This novel architecture helps to improve the contact map prediction by surpassing redundant features and encouraging more meaningful ones from 1D and 2D inputs simultaneously. We also introduce a new custom dynamic binary cross-entropy (BCE) as the loss function to extract essential details from feature maps, and thereby address the input imbalance problem for highly sparse long-range contacts in proteins with insufficient numbers of homologous sequences. We evaluate the performance of CGAN-Cmap on the 11th, 12th, 13th, and 14th Critical Assessment of protein Structure Prediction (CASP 11, 12, 13, and 14) and CAMEO test sets. CGAN-Cmap significantly outperforms state-of-the-art models, and in particular, it improves the precision of medium and long-range contact by at least 3.5%. Furthermore, our model has a low dependency on the number of homologous sequences obtained via multiple sequence alignment, suggesting that it can predict protein contact maps with good accuracy for those proteins that lack homologous templates. These results demonstrate an efficient approach for fast and highly accurate contact map prediction toward construction of protein 3D structure from protein sequence.
Data availability
All datasets and source codes are provided in:
https://github.com/mahan-fcb/CGAN-Cmap-A-protein-contact-map-predictor
Title: CGAN-Cmap: protein contact map prediction using deep generative adversarial neural networks
Description:
Abstract
Protein contact maps represent spatial pairwise inter-residue interactions, providing a protein’s translationally and rotationally invariant topological representation.
Accurate contact map prediction has been a critical driving force for improving protein structure prediction, one of computational biology’s most challenging problems in the last half-century.
While many computational tools have been developed to this end, most fail to predict accurate contact maps for proteins with insufficient homologous protein sequences, and exhibit low accuracy for long-range contacts.
To address these limitations, we develop a novel hybrid model, CGAN-Cmap, that uses a generative adversarial neural network embedded with a series of modified squeeze and excitation residual networks.
To exploit features of different dimensions, we build the generator of CGAN-Cmap via two parallel modules: sequential and pairwise modules to capture and interpret distance profiles from 1D sequential and 2D pairwise feature maps, respectively, and combine them during the training process to generate the contact map.
This novel architecture helps to improve the contact map prediction by surpassing redundant features and encouraging more meaningful ones from 1D and 2D inputs simultaneously.
We also introduce a new custom dynamic binary cross-entropy (BCE) as the loss function to extract essential details from feature maps, and thereby address the input imbalance problem for highly sparse long-range contacts in proteins with insufficient numbers of homologous sequences.
We evaluate the performance of CGAN-Cmap on the 11th, 12th, 13th, and 14th Critical Assessment of protein Structure Prediction (CASP 11, 12, 13, and 14) and CAMEO test sets.
CGAN-Cmap significantly outperforms state-of-the-art models, and in particular, it improves the precision of medium and long-range contact by at least 3.
5%.
Furthermore, our model has a low dependency on the number of homologous sequences obtained via multiple sequence alignment, suggesting that it can predict protein contact maps with good accuracy for those proteins that lack homologous templates.
These results demonstrate an efficient approach for fast and highly accurate contact map prediction toward construction of protein 3D structure from protein sequence.
Data availability
All datasets and source codes are provided in:
https://github.
com/mahan-fcb/CGAN-Cmap-A-protein-contact-map-predictor.
Related Results
Individualized causal mediation analysis with continuous treatment using conditional generative adversarial networks
Individualized causal mediation analysis with continuous treatment using conditional generative adversarial networks
AbstractTraditional methods used in causal mediation analysis with continuous treatment often focus on estimating average causal effects, limiting their applicability in precision ...
Similarity indicator and CG-CGAN prediction model for remaining useful life of rolling bearings
Similarity indicator and CG-CGAN prediction model for remaining useful life of rolling bearings
Abstract
To tackle the challenges of performing early fault warning and improving the prediction accuracy for the remaining useful life (RUL) of rolling bearings, th...
ProDef-MDS: A Proactive Defense Mechanism Protecting Malware Detection Systems from Adversarial Attacks
ProDef-MDS: A Proactive Defense Mechanism Protecting Malware Detection Systems from Adversarial Attacks
Malware threatens cybersecurity by enabling data theft, unauthorized access, and extortion. Traditional malware detection systems (MDS) struggle with the increasing volume and comp...
Predicting Students’ Academic Performance with Conditional Generative Adversarial Network and Deep SVM
Predicting Students’ Academic Performance with Conditional Generative Adversarial Network and Deep SVM
The availability of educational data obtained by technology-assisted learning platforms can potentially be used to mine student behavior in order to address their problems and enha...
Efficient Defense Against First Order Adversarial Attacks on Convolutional Neural Networks
Efficient Defense Against First Order Adversarial Attacks on Convolutional Neural Networks
Machine learning models, especially neural networks, are vulnerable to adversarial attacks, where inputs are purposefully altered to induce incorrect predictions. These adversarial...
Protein contact distance and structure prediction driven by deep learning
Protein contact distance and structure prediction driven by deep learning
Proteins, fundamental building blocks of living organisms, play a crucial role in various biological processes. Understanding protein structure is essential for unraveling their fu...
Enhancing Adversarial Robustness through Stable Adversarial Training
Enhancing Adversarial Robustness through Stable Adversarial Training
Deep neural network models are vulnerable to attacks from adversarial methods, such as gradient attacks. Evening small perturbations can cause significant differences in their pred...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...

