Protein structure generation via folding diffusion

09/30/2022
by   Kevin E. Wu, et al.
13

The ability to computationally generate novel yet physically foldable protein structures could lead to new biological discoveries and new treatments targeting yet incurable diseases. Despite recent advances in protein structure prediction, directly generating diverse, novel protein structures from neural networks remains difficult. In this work, we present a new diffusion-based generative model that designs protein backbone structures via a procedure that mirrors the native folding process. We describe protein backbone structure as a series of consecutive angles capturing the relative orientation of the constituent amino acid residues, and generate new structures by denoising from a random, unfolded state towards a stable folded structure. Not only does this mirror how proteins biologically twist into energetically favorable conformations, the inherent shift and rotational invariance of this representation crucially alleviates the need for complex equivariant networks. We train a denoising diffusion probabilistic model with a simple transformer backbone and demonstrate that our resulting model unconditionally generates highly realistic protein structures with complexity and structural patterns akin to those of naturally-occurring proteins. As a useful resource, we release the first open-source codebase and trained models for protein structure diffusion.

READ FULL TEXT

page 4

page 8

page 9

page 14

page 16

page 19

page 20

page 21

research
01/29/2023

Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds

Proteins power a vast array of functional processes in living cells. The...
research
02/05/2023

SE(3) diffusion model with application to protein backbone generation

The design of novel protein structures remains a challenge in protein en...
research
04/05/2023

EigenFold: Generative Protein Structure Prediction with Diffusion Models

Protein structure prediction has reached revolutionary levels of accurac...
research
06/08/2022

Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem

Construction of a scaffold structure that supports a desired motif, conf...
research
05/26/2022

Protein Structure and Sequence Generation with Equivariant Denoising Diffusion Probabilistic Models

Proteins are macromolecules that mediate a significant fraction of the c...
research
07/23/2023

DiAMoNDBack: Diffusion-denoising Autoregressive Model for Non-Deterministic Backmapping of Cα Protein Traces

Coarse-grained molecular models of proteins permit access to length and ...
research
11/09/2019

Accurate Protein Structure Prediction by Embeddings and Deep Learning Representations

Proteins are the major building blocks of life, and actuators of almost ...

Please sign up or login with your details

Forgot password? Click here to reset