Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem

06/08/2022
by   Brian L Trippe, et al.
15

Construction of a scaffold structure that supports a desired motif, conferring protein function, shows promise for the design of vaccines and enzymes. But a general solution to this motif-scaffolding problem remains open. Current machine-learning techniques for scaffold design are either limited to unrealistically small scaffolds (up to length 20) or struggle to produce multiple diverse scaffolds. We propose to learn a distribution over diverse and longer protein backbone structures via an E(3)-equivariant graph neural network. We develop SMCDiff to efficiently sample scaffolds from this distribution conditioned on a given motif; our algorithm is the first to theoretically guarantee conditional samples from a diffusion model in the large-compute limit. We evaluate our designed backbones by how well they align with AlphaFold2-predicted structures. We show that our method can (1) sample scaffolds up to 80 residues and (2) achieve structurally diverse scaffolds for a fixed motif.

READ FULL TEXT
research
05/06/2023

A Latent Diffusion Model for Protein Structure Generation

Proteins are complex biomolecules that perform a variety of crucial func...
research
09/30/2022

Protein structure generation via folding diffusion

The ability to computationally generate novel yet physically foldable pr...
research
01/29/2023

Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds

Proteins power a vast array of functional processes in living cells. The...
research
08/24/2021

Stationarity and inference in multistate promoter models of stochastic gene expression via stick-breaking measures

In a general stochastic multistate promoter model of dynamic mRNA/protei...
research
10/24/2022

Structure-based Drug Design with Equivariant Diffusion Models

Structure-based drug design (SBDD) aims to design small-molecule ligands...
research
08/05/2020

Protein Conformational States: A First Principles Bayesian Method

Automated identification of protein conformational states from simulatio...
research
02/05/2023

SE(3) diffusion model with application to protein backbone generation

The design of novel protein structures remains a challenge in protein en...

Please sign up or login with your details

Forgot password? Click here to reset