DeepQA: Improving the estimation of single protein model quality with deep belief networks

07/15/2016
by   Renzhi Cao, et al.
0

Protein quality assessment (QA) by ranking and selecting protein models has long been viewed as one of the major challenges for protein tertiary structure prediction. Especially, estimating the quality of a single protein model, which is important for selecting a few good models out of a large model pool consisting of mostly low-quality models, is still a largely unsolved problem. We introduce a novel single-model quality assessment method DeepQA based on deep belief network that utilizes a number of selected features describing the quality of a model from different perspectives, such as energy, physio-chemical characteristics, and structural information. The deep belief network is trained on several large datasets consisting of models from the Critical Assessment of Protein Structure Prediction (CASP) experiments, several publicly available datasets, and models generated by our in-house ab initio method. Our experiment demonstrate that deep belief network has better performance compared to Support Vector Machines and Neural Networks on the protein model quality assessment problem, and our method DeepQA achieves the state-of-the-art performance on CASP11 dataset. It also outperformed two well-established methods in selecting good outlier models from a large set of models of mostly low quality generated by ab initio modeling methods. DeepQA is a useful tool for protein single model quality assessment and protein structure prediction. The source code, executable, document and training/test datasets of DeepQA for Linux is freely available to non-commercial users at http://cactus.rnet.missouri.edu/DeepQA/.

READ FULL TEXT

page 13

page 14

research
02/13/2016

Evaluation of Protein Structural Models Using Random Forests

Protein structure prediction has been a grand challenge problem in the s...
research
11/27/2020

Protein model quality assessment using rotation-equivariant, hierarchical neural networks

Proteins are miniature machines whose function depends on their three-di...
research
12/28/2019

Energy-based Graph Convolutional Networks for Scoring Protein Docking Models

Structural information about protein-protein interactions, often missing...
research
10/03/2020

Decoy Selection for Protein Structure Prediction Via Extreme Gradient Boosting and Ranking

Identifying one or more biologically-active/native decoys from millions ...
research
11/16/2020

Spherical convolutions on molecular graphs for protein model quality assessment

Processing information on 3D objects requires methods stable to rigid-bo...
research
05/11/2021

EBM-Fold: Fully-Differentiable Protein Folding Powered by Energy-based Models

Accurate protein structure prediction from amino-acid sequences is criti...
research
09/16/2021

PDBench: Evaluating Computational Methods for Protein Sequence Design

Proteins perform critical processes in all living systems: converting so...

Please sign up or login with your details

Forgot password? Click here to reset