Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Protein Fold Classification using Graph Neural Network and Protein Topology Graph

View through CrossRef
AbstractProtein fold classification reveals key structural information about proteins that is essential for understanding their function. While numerous approaches exist in the literature that classifies protein fold from sequence data using machine learning, there is hardly any approach that classifies protein fold from the secondary or tertiary structure data using deep learning. This work proposes a novel protein fold classification technique based on graph neural network and protein topology graphs. Protein topology graphs are constructed according to definitions in the Protein Topology Graph Library from protein secondary structure level data and their contacts. To the best of our knowledge, this is the first approach that applies graph neural network for protein fold classification. We analyze the SCOPe 2.07 data set, a manually and computationally curated database that classifies known protein structures into their taxonomic hierarchy and provides predefined labels for a certain number of entries from the Protein Data Bank. We also analyze the latest version of the CATH data set. Experimental results show that the classification accuracy is at around 82% − 100% under certain settings. Due to the rapid growth of structural data, automating the structure classification process with high accuracy using structural data is much needed in the field. This work introduces a new paradigm of protein fold classification that meets this need. The implementation of the model for protein fold classification and the datasets are available here https://github.com/SuriDipannitaSayeed/ProteinFoldClassification.gitAuthor summaryClassification of protein structures is traditionally done using manual curation, evolutionary relationship, or sequence comparison-based methods. Applying machine learning and deep learning to protein structure classification is a comparatively new trend that holds great promises for automating the structure classification process. Advance deep learning technique like Graph Neural Network is still unexplored in this respect. SCOP and CATH are two traditional databases that provide the hierarchical taxonomic classification of protein structures. This work provides a novel computational approach that classifies protein folds in SCOP and CATH with graph neural network, performing a graph classification task.
Title: Protein Fold Classification using Graph Neural Network and Protein Topology Graph
Description:
AbstractProtein fold classification reveals key structural information about proteins that is essential for understanding their function.
While numerous approaches exist in the literature that classifies protein fold from sequence data using machine learning, there is hardly any approach that classifies protein fold from the secondary or tertiary structure data using deep learning.
This work proposes a novel protein fold classification technique based on graph neural network and protein topology graphs.
Protein topology graphs are constructed according to definitions in the Protein Topology Graph Library from protein secondary structure level data and their contacts.
To the best of our knowledge, this is the first approach that applies graph neural network for protein fold classification.
We analyze the SCOPe 2.
07 data set, a manually and computationally curated database that classifies known protein structures into their taxonomic hierarchy and provides predefined labels for a certain number of entries from the Protein Data Bank.
We also analyze the latest version of the CATH data set.
Experimental results show that the classification accuracy is at around 82% − 100% under certain settings.
Due to the rapid growth of structural data, automating the structure classification process with high accuracy using structural data is much needed in the field.
This work introduces a new paradigm of protein fold classification that meets this need.
The implementation of the model for protein fold classification and the datasets are available here https://github.
com/SuriDipannitaSayeed/ProteinFoldClassification.
gitAuthor summaryClassification of protein structures is traditionally done using manual curation, evolutionary relationship, or sequence comparison-based methods.
Applying machine learning and deep learning to protein structure classification is a comparatively new trend that holds great promises for automating the structure classification process.
Advance deep learning technique like Graph Neural Network is still unexplored in this respect.
SCOP and CATH are two traditional databases that provide the hierarchical taxonomic classification of protein structures.
This work provides a novel computational approach that classifies protein folds in SCOP and CATH with graph neural network, performing a graph classification task.

Related Results

Graph convolutional neural networks for 3D data analysis
Graph convolutional neural networks for 3D data analysis
(English) Deep Learning allows the extraction of complex features directly from raw input data, eliminating the need for hand-crafted features from the classical Machine Learning p...
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Optimising tool wear and workpiece condition monitoring via cyber-physical systems for smart manufacturing
Smart manufacturing has been developed since the introduction of Industry 4.0. It consists of resource sharing and networking, predictive engineering, and material and data analyti...
Role of Organic Agriculture in Enhancing Soil Health: Implications for Physico-Chemical and Biological Properties
Role of Organic Agriculture in Enhancing Soil Health: Implications for Physico-Chemical and Biological Properties
Soil health is fundamental to sustainable agriculture and food security. Organic agricultural practices have gained increasing recognition for its capacity to improving soil health...
Artificial Neural Network Topology Optimization using K-Fold Cross Validation for Spray Drying of Coconut Milk
Artificial Neural Network Topology Optimization using K-Fold Cross Validation for Spray Drying of Coconut Milk
Abstract In this study, the development of an optimized topology neural network model for spray drying coconut milk is investigated using K-fold cross validation tec...
Domination of Polynomial with Application
Domination of Polynomial with Application
In this paper, .We .initiate the study of domination. polynomial , consider G=(V,E) be a simple, finite, and directed graph without. isolated. vertex .We present a study of the Ira...
Modified neural networks for rapid recovery of tokamak plasma parameters for real time control
Modified neural networks for rapid recovery of tokamak plasma parameters for real time control
Two modified neural network techniques are used for the identification of the equilibrium plasma parameters of the Superconducting Steady State Tokamak I from external magnetic mea...
CommunityGCN: community detection using node classification with graph convolution network
CommunityGCN: community detection using node classification with graph convolution network
PurposeA community demonstrates the unique qualities and relationships between its members that distinguish it from other communities within a network. Network analysis relies heav...

Back to Top