Javascript must be enabled to continue!
Learning structural motif representations for efficient protein structure search
View through CrossRef
Abstract
Motivation
Understanding the relationship between protein structure and function is a fundamental problem in protein science. Given a protein of unknown function, fast identification of similar protein structures from the Protein Data Bank (PDB) is a critical step for inferring its biological function. Such structural neighbors can provide evolutionary insights into protein conformation, interfaces and binding sites that are not detectable from sequence similarity. However, the computational cost of performing pairwise structural alignment against all structures in PDB is prohibitively expensive. Alignment-free approaches have been introduced to enable fast but coarse comparisons by representing each protein as a vector of structure features or fingerprints and only computing similarity between vectors. As a notable example, FragBag represents each protein by a “bag of fragments”, which is a vector of frequencies of contiguous short backbone fragments from a predetermined library.
Results
Here we present a new approach to learning effective structural motif presentations using deep learning. We develop DeepFold, a deep convolutional neural network model to extract structural motif features of a protein structure. Similar to FragBag, DeepFold represents each protein structure or fold using a vector of learned structural motif features. We demonstrate that DeepFold substantially outperforms FragBag on protein structural search on a non-redundant protein structure database and a set of newly released structures. Remarkably, DeepFold not only extracts meaningful backbone segments but also finds important long-range interacting motifs for structural comparison. We expect that DeepFold will provide new insights into the evolution and hierarchical organization of protein structural motifs.
Availability
https://github.com/largelymfs/DeepFold
Contact
jianpeng@illinois.edu
Title: Learning structural motif representations for efficient protein structure search
Description:
Abstract
Motivation
Understanding the relationship between protein structure and function is a fundamental problem in protein science.
Given a protein of unknown function, fast identification of similar protein structures from the Protein Data Bank (PDB) is a critical step for inferring its biological function.
Such structural neighbors can provide evolutionary insights into protein conformation, interfaces and binding sites that are not detectable from sequence similarity.
However, the computational cost of performing pairwise structural alignment against all structures in PDB is prohibitively expensive.
Alignment-free approaches have been introduced to enable fast but coarse comparisons by representing each protein as a vector of structure features or fingerprints and only computing similarity between vectors.
As a notable example, FragBag represents each protein by a “bag of fragments”, which is a vector of frequencies of contiguous short backbone fragments from a predetermined library.
Results
Here we present a new approach to learning effective structural motif presentations using deep learning.
We develop DeepFold, a deep convolutional neural network model to extract structural motif features of a protein structure.
Similar to FragBag, DeepFold represents each protein structure or fold using a vector of learned structural motif features.
We demonstrate that DeepFold substantially outperforms FragBag on protein structural search on a non-redundant protein structure database and a set of newly released structures.
Remarkably, DeepFold not only extracts meaningful backbone segments but also finds important long-range interacting motifs for structural comparison.
We expect that DeepFold will provide new insights into the evolution and hierarchical organization of protein structural motifs.
Availability
https://github.
com/largelymfs/DeepFold
Contact
jianpeng@illinois.
edu.
Related Results
Bentuk Dan Fungsi Batee Ranup Bagi Masyarakat Aceh
Bentuk Dan Fungsi Batee Ranup Bagi Masyarakat Aceh
ABSTRACT Batee ranup has a variety of shapes and motifs, such as round or round oval shapes that have legs and there are also square shapes in general. Batee ranup has five kinds o...
Form Follows Force: A theoretical framework for Structural Morphology, and Form-Finding research on shell structures
Form Follows Force: A theoretical framework for Structural Morphology, and Form-Finding research on shell structures
The springing up of freeform architecture and structures introduces many challenges to structural engineers. The main challenge is to generate structural forms with high structural...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...
Residues Neighboring an SH3-Binding Motif Participate in the Interaction
In Vivo
Residues Neighboring an SH3-Binding Motif Participate in the Interaction
In Vivo
Abstract
In signaling networks, protein-protein interactions are often mediated by modular domains that bind short linear motifs. The motifs’ seq...
KAJIAN MOTIF BATIK PRING SEDAPUR KARYA NUNUNG WIJAYANTI DI GROBOGAN MENGGUNAKAN KONSEP PENCIPTAAN KRIYA
KAJIAN MOTIF BATIK PRING SEDAPUR KARYA NUNUNG WIJAYANTI DI GROBOGAN MENGGUNAKAN KONSEP PENCIPTAAN KRIYA
ABSTRAK Batik merupakan salah satu perwujudan dari kebudayaan Indonesia yang dituangkan dalam selembar kain. Batik Grobogan merupakan salah satu ikon yang menggambarkan karakterist...
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Evaluating the Science to Inform the Physical Activity Guidelines for Americans Midcourse Report
Abstract
The Physical Activity Guidelines for Americans (Guidelines) advises older adults to be as active as possible. Yet, despite the well documented benefits of physical a...
Development of Malay Deli Songket Motifs Based on Symmetry Groups
Development of Malay Deli Songket Motifs Based on Symmetry Groups
One of the tribes in North Sumatra Province that has a wide variety of art is the Deli Malays, especially the Songket motifs. Songket is a type of traditional Indonesian weaving th...

