Protein Contact Prediction by Integrating Joint Evolutionary Coupling Analysis and Supervised Learning

12/10/2013
by   Jianzhu Ma, et al.
0

Protein contacts contain important information for protein structure and functional study, but contact prediction from sequence remains very challenging. Both evolutionary coupling (EC) analysis and supervised machine learning methods are developed to predict contacts, making use of different types of information, respectively. This paper presents a group graphical lasso (GGL) method for contact prediction that integrates joint multi-family EC analysis and supervised learning. Different from existing single-family EC analysis that uses residue co-evolution information in only the target protein family, our joint EC analysis uses residue co-evolution in both the target family and its related families, which may have divergent sequences but similar folds. To implement joint EC analysis, we model a set of related protein families using Gaussian graphical models (GGM) and then co-estimate their precision matrices by maximum-likelihood, subject to the constraint that the precision matrices shall share similar residue co-evolution patterns. To further improve the accuracy of the estimated precision matrices, we employ a supervised learning method to predict contact probability from a variety of evolutionary and non-evolutionary information and then incorporate the predicted probability as prior into our GGL framework. Experiments show that our method can predict contacts much more accurately than existing methods, and that our method performs better on both conserved and family-specific contacts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/24/2017

Predicting membrane protein contacts from non-membrane proteins by deep transfer learning

Computational prediction of membrane protein (MP) structures is very cha...
research
08/08/2013

Predicting protein contact map using evolutionary and physical constraints by integer programming (extended version)

Motivation. Protein contact map describes the pairwise spatial and funct...
research
08/31/2020

Unsupervised and Supervised Structure Learning for Protein Contact Prediction

Protein contacts provide key information for the understanding of protei...
research
08/31/2018

Predicting protein inter-residue contacts using composite likelihood maximization and deep learning

Accurate prediction of inter-residue contacts of a protein is important ...
research
09/10/2019

Boltzmann machine learning and regularization methods for inferring evolutionary fields and couplings from a multiple sequence alignment

The inverse Potts problem to infer the Boltzmann distribution for homolo...
research
02/13/2023

Knowledge from Large-Scale Protein Contact Prediction Models Can Be Transferred to the Data-Scarce RNA Contact Prediction Task

RNA, whose functionality is largely determined by its structure, plays a...
research
05/07/2020

Improving supervised prediction of aging-related genes via dynamic network analysis

This study focuses on supervised prediction of aging-related genes from ...

Please sign up or login with your details

Forgot password? Click here to reset