Predicting protein inter-residue contacts using composite likelihood maximization and deep learning

08/31/2018
by   Haicang Zhang, et al.
0

Accurate prediction of inter-residue contacts of a protein is important to calcu- lating its tertiary structure. Analysis of co-evolutionary events among residues has been proved effective to inferring inter-residue contacts. The Markov ran- dom field (MRF) technique, although being widely used for contact prediction, suffers from the following dilemma: the actual likelihood function of MRF is accurate but time-consuming to calculate, in contrast, approximations to the actual likelihood, say pseudo-likelihood, are efficient to calculate but inaccu- rate. Thus, how to achieve both accuracy and efficiency simultaneously remains a challenge. In this study, we present such an approach (called clmDCA) for contact prediction. Unlike plmDCA using pseudo-likelihood, i.e., the product of conditional probability of individual residues, our approach uses composite- likelihood, i.e., the product of conditional probability of all residue pairs. Com- posite likelihood has been theoretically proved as a better approximation to the actual likelihood function than pseudo-likelihood. Meanwhile, composite likelihood is still efficient to maximize, thus ensuring the efficiency of clmDCA. We present comprehensive experiments on popular benchmark datasets, includ- ing PSICOV dataset and CASP-11 dataset, to show that: i) clmDCA alone outperforms the existing MRF-based approaches in prediction accuracy. ii) When equipped with deep learning technique for refinement, the prediction ac- curacy of clmDCA was further significantly improved, suggesting the suitability of clmDCA for subsequent refinement procedure. We further present successful application of the predicted contacts to accurately build tertiary structures for proteins in the PSICOV dataset. Accessibility: The software clmDCA and a server are publicly accessible through http://protein.ict.ac.cn/clmDCA/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2016

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model

Recently exciting progress has been made on protein contact prediction, ...
research
04/24/2017

Predicting membrane protein contacts from non-membrane proteins by deep transfer learning

Computational prediction of membrane protein (MP) structures is very cha...
research
12/10/2013

Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning

Protein contacts contain important information for protein structure and...
research
05/26/2022

DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning

Predicted inter-chain residue-residue contacts can be used to build the ...
research
08/08/2013

Predicting protein contact map using evolutionary and physical constraints by integer programming (extended version)

Motivation. Protein contact map describes the pairwise spatial and funct...
research
06/08/2023

Heterogeneity-aware integrative analyses for ancestry-specific association studies

Ancestry-specific proteome-wide association studies (PWAS) based on gene...

Please sign up or login with your details

Forgot password? Click here to reset