Protein Conformational States: A First Principles Bayesian Method

08/05/2020
by   David M. Rogers, et al.
0

Automated identification of protein conformational states from simulation of an ensemble of structures is a hard problem because it requires teaching a computer to recognize shapes. We adapt the naive Bayes classifier from the machine learning community for use on atom-to-atom pairwise contacts. The result is an unsupervised learning algorithm that samples a `distribution' over potential classification schemes. We apply the classifier to a series of test structures and one real protein, showing that it identifies the conformational transition with > 95 adaptation is a new connection to information entropy that allows us to vary the level of structural detail without spoiling the categorization. This is confirmed by comparing results as the number of atoms and time-samples are varied over 1.5 orders of magnitude. Further, the method's derivation from Bayesian analysis on the set of inter-atomic contacts makes it easy to understand and extend to more complex cases.

READ FULL TEXT
research
05/31/2022

Contrastive Representation Learning for 3D Protein Structures

Learning from 3D protein structures has gained wide interest in protein ...
research
05/31/2018

Conformation Clustering of Long MD Protein Dynamics with an Adversarial Autoencoder

Recent developments in specialized computer hardware have greatly accele...
research
10/07/2019

Weighted graphlets and deep neural networks for protein structure classification

As proteins with similar structures often have similar functions, analys...
research
10/25/2022

Estimating Boltzmann Averages for Protein Structural Quantities Using Sequential Monte Carlo

Sequential Monte Carlo (SMC) methods are widely used to draw samples fro...
research
06/08/2022

Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem

Construction of a scaffold structure that supports a desired motif, conf...
research
10/04/2016

A novel and effective scoring scheme for structure classification and pairwise similarity measurement

Protein tertiary structure defines its functions, classification and bin...
research
12/20/2017

Unsupervised learning of dynamical and molecular similarity using variance minimization

In this report, we present an unsupervised machine learning method for d...

Please sign up or login with your details

Forgot password? Click here to reset