AlphaDesign: A graph protein design method and benchmark on AlphaFoldDB

02/01/2022
by   Zhangyang Gao, et al.
0

While DeepMind has tentatively solved protein folding, its inverse problem – protein design which predicts protein sequences from their 3D structures – still faces significant challenges. Particularly, the lack of large-scale standardized benchmark and poor accuray hinder the research progress. In order to standardize comparisons and draw more research interest, we use AlphaFold DB, one of the world's largest protein structure databases, to establish a new graph-based benchmark – AlphaDesign. Based on AlphaDesign, we propose a new method called ADesign to improve accuracy by introducing protein angles as new features, using a simplified graph transformer encoder (SGT), and proposing a confidence-aware protein decoder (CPD). Meanwhile, SGT and CPD also improve model efficiency by simplifying the training and testing procedures. Experiments show that ADesign significantly outperforms previous graph models, e.g., the average accuracy is improved by 8%, and the inference speed is 40+ times faster than before.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2021

Mimetic Neural Networks: A unified framework for Protein Design and Folding

Recent advancements in machine learning techniques for protein folding m...
research
01/25/2016

PGR: A Graph Repository of Protein 3D-Structures

Graph theory and graph mining constitute rich fields of computational te...
research
10/16/2021

Geometric Transformers for Protein Interface Contact Prediction

Computational methods for predicting the interface contacts between prot...
research
04/27/2022

TERMinator: A Neural Framework for Structure-Based Protein Design using Tertiary Repeating Motifs

Computational protein design has the potential to deliver novel molecula...
research
08/24/2021

Stationarity and inference in multistate promoter models of stochastic gene expression via stick-breaking measures

In a general stochastic multistate promoter model of dynamic mRNA/protei...
research
07/13/2020

ProteiNN: Intrinsic-Extrinsic Convolution and Pooling for Scalable Deep Protein Analysis

Proteins perform a large variety of functions in living organisms, thus ...
research
06/08/2023

Heterogeneity-aware integrative analyses for ancestry-specific association studies

Ancestry-specific proteome-wide association studies (PWAS) based on gene...

Please sign up or login with your details

Forgot password? Click here to reset