Deep learning based mixed-dimensional GMM for characterizing variability in CryoEM

01/25/2021
by   Muyuan Chen, et al.
12

The function of most protein molecules involves structural flexibility and/or dynamic interactions with other molecules. CryoEM provides direct visualization of individual macromolecules in different conformational and compositional states. While many methods are available for classification of discrete states, characterization of continuous conformational changes or large numbers of discrete state without human supervision remains challenging. Here we present a machine learning algorithm to determine a conformational landscape for proteins or complexes using a 3-D Gaussian mixture model mapped onto 2-D particle images in known orientations. Using a deep neural network architecture, this method can automatically resolve the structural heterogeneity within the protein complex and map particles onto a small latent space describing conformational and compositional changes. This system presents a more intuitive and flexible representation than other manifold methods currently in use. We demonstrate this method on several different biomolecular systems to explore compositional and conformational changes at a range of scales.

READ FULL TEXT

page 12

page 13

page 14

page 15

page 21

research
05/20/2022

Learning Geometrically Disentangled Representations of Protein Folding Simulations

Massive molecular simulations of drug-target proteins have been used as ...
research
08/27/2021

Variational embedding of protein folding simulations using gaussian mixture variational autoencoders

Conformational sampling of biomolecules using molecular dynamics simulat...
research
07/26/2017

Prediction of amino acid side chain conformation using a deep neural network

A deep neural network based architecture was constructed to predict amin...
research
11/25/2022

Latent Space Diffusion Models of Cryo-EM Structures

Cryo-electron microscopy (cryo-EM) is unique among tools in structural b...
research
05/18/2021

Conformational variability of loops in the SARS-CoV-2 spike protein

The SARS-CoV-2 spike (S) protein facilitates viral infection, and has be...
research
04/30/2020

On the Spontaneous Emergence of Discrete and Compositional Signals

We propose a general framework to study language emergence through signa...
research
03/18/2020

Site2Vec: a reference frame invariant algorithm for vector embedding of protein-ligand binding sites

Protein-ligand interactions are one of the fundamental types of molecula...

Please sign up or login with your details

Forgot password? Click here to reset