Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Predicting protein inter-residue contacts using composite likelihood maximization and deep learning

View through CrossRef
AbstractBackgroundAccurate prediction of inter-residue contacts of a protein is important to calculating its tertiary structure. Analysis of co-evolutionary events among residues has been proved effective in inferring inter-residue contacts. The Markov random field (MRF) technique, although being widely used for contact prediction, suffers from the following dilemma: the actual likelihood function of MRF is accurate but time-consuming to calculate; in contrast, approximations to the actual likelihood, say pseudo-likelihood, are efficient to calculate but inaccurate. Thus, how to achieve both accuracy and efficiency simultaneously remains a challenge.ResultsIn this study, we present such an approach (called clmDCA) for contact prediction. Unlike plmDCA using pseudo-likelihood, i.e., the product of conditional probability of individual residues, our approach uses composite-likelihood, i.e., the product of conditional probability of all residue pairs. Composite likelihood has been theoretically proved as a better approximation to the actual likelihood function than pseudo-likelihood. Meanwhile, composite likelihood is still efficient to maximize, thus ensuring the efficiency of clmDCA. We present comprehensive experiments on popular benchmark datasets, including PSICOV dataset and CASP-11 dataset, to show that:i) clmDCA alone outperforms the existing MRF-based approaches in prediction accuracy.ii) When equipped with deep learning technique for refinement, the prediction accuracy of clmDCA was further significantly improved, suggesting the suitability of clmDCA for subsequent refinement procedure. We further present a successful application of the predicted contacts to accurately build tertiary structures for proteins in the PSICOV dataset.ConclusionsComposite likelihood maximization algorithm can efficiently estimate the parameters of Markov Random Fields and can improve the prediction accuracy of protein inter-residue contacts.
Title: Predicting protein inter-residue contacts using composite likelihood maximization and deep learning
Description:
AbstractBackgroundAccurate prediction of inter-residue contacts of a protein is important to calculating its tertiary structure.
Analysis of co-evolutionary events among residues has been proved effective in inferring inter-residue contacts.
The Markov random field (MRF) technique, although being widely used for contact prediction, suffers from the following dilemma: the actual likelihood function of MRF is accurate but time-consuming to calculate; in contrast, approximations to the actual likelihood, say pseudo-likelihood, are efficient to calculate but inaccurate.
Thus, how to achieve both accuracy and efficiency simultaneously remains a challenge.
ResultsIn this study, we present such an approach (called clmDCA) for contact prediction.
Unlike plmDCA using pseudo-likelihood, i.
e.
, the product of conditional probability of individual residues, our approach uses composite-likelihood, i.
e.
, the product of conditional probability of all residue pairs.
Composite likelihood has been theoretically proved as a better approximation to the actual likelihood function than pseudo-likelihood.
Meanwhile, composite likelihood is still efficient to maximize, thus ensuring the efficiency of clmDCA.
We present comprehensive experiments on popular benchmark datasets, including PSICOV dataset and CASP-11 dataset, to show that:i) clmDCA alone outperforms the existing MRF-based approaches in prediction accuracy.
ii) When equipped with deep learning technique for refinement, the prediction accuracy of clmDCA was further significantly improved, suggesting the suitability of clmDCA for subsequent refinement procedure.
We further present a successful application of the predicted contacts to accurately build tertiary structures for proteins in the PSICOV dataset.
ConclusionsComposite likelihood maximization algorithm can efficiently estimate the parameters of Markov Random Fields and can improve the prediction accuracy of protein inter-residue contacts.

Related Results

ISSEC: inferring contacts among protein secondary structure elements using deep object detection
ISSEC: inferring contacts among protein secondary structure elements using deep object detection
Abstract Background The formation of contacts among protein secondary structure elements (SSEs) is an important step in protein foldi...
Biscuit Residue in the Nutrition of Laying Hens: Effects on Animal Health, Performance and Egg Quality
Biscuit Residue in the Nutrition of Laying Hens: Effects on Animal Health, Performance and Egg Quality
Background: Corn and soybean meal are common ingredients used in poultry feed in order to supply the demand for energy and protein, respectively. Also, these ingredients directly i...
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Frequency of Common Chromosomal Abnormalities in Patients with Idiopathic Acquired Aplastic Anemia
Objective: To determine the frequency of common chromosomal aberrations in local population idiopathic determine the frequency of common chromosomal aberrations in local population...
Tuberculosis yield among contacts of non-pulmonary bacteriologically confirmed index TB patients in the urban setting of central Uganda
Tuberculosis yield among contacts of non-pulmonary bacteriologically confirmed index TB patients in the urban setting of central Uganda
Background The World Health Organization (WHO) recommends systematic and active investigation of TB contacts. However, lower priority is given to contact investigation among other ...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
Endothelial Protein C Receptor
Endothelial Protein C Receptor
IntroductionThe protein C anticoagulant pathway plays a critical role in the negative regulation of the blood clotting response. The pathway is triggered by thrombin, which allows ...
Correlated mutations distinguish misfolded and properly folded proteins
Correlated mutations distinguish misfolded and properly folded proteins
Knowledge about the three dimensional structure of proteins is crucial in order to learn about their behavior, stability, or role as a target in drug design. Unfortunately, traditi...

Back to Top