Predicting protein stability changes under multiple amino acid substitutions using equivariant graph neural networks

05/30/2023
by   Sebastien Boyer, et al.
0

The accurate prediction of changes in protein stability under multiple amino acid substitutions is essential for realising true in-silico protein re-design. To this purpose, we propose improvements to state-of-the-art Deep learning (DL) protein stability prediction models, enabling first-of-a-kind predictions for variable numbers of amino acid substitutions, on structural representations, by decoupling the atomic and residue scales of protein representations. This was achieved using E(3)-equivariant graph neural networks (EGNNs) for both atomic environment (AE) embedding and residue-level scoring tasks. Our AE embedder was used to featurise a residue-level graph, then trained to score mutant stability (ΔΔ G). To achieve effective training of this predictive EGNN we have leveraged the unprecedented scale of a new high-throughput protein stability experimental data-set, Mega-scale. Finally, we demonstrate the immediately promising results of this procedure, discuss the current shortcomings, and highlight potential future strategies.

READ FULL TEXT

page 6

page 8

page 12

page 14

page 16

page 17

page 18

research
02/08/2018

mGPfusion: Predicting protein stability changes with Gaussian process kernel learning and data fusion

Proteins are commonly used by biochemical industry for numerous processe...
research
04/27/2020

Energy-based models for atomic-resolution protein conformations

We propose an energy-based model (EBM) of protein conformations that ope...
research
11/08/2021

AI challenges for predicting the impact of mutations on protein stability

Stability is a key ingredient of protein fitness and its modification th...
research
06/21/2023

Predicting protein variants with equivariant graph neural networks

Pre-trained models have been successful in many protein engineering task...
research
07/24/2020

Hierachial Protein Function Prediction with Tails-GNNs

Protein function prediction may be framed as predicting subgraphs (with ...
research
07/21/2021

Structure-aware Interactive Graph Neural Networks for the Prediction of Protein-Ligand Binding Affinity

Drug discovery often relies on the successful prediction of protein-liga...
research
02/08/2022

ECRECer: Enzyme Commission Number Recommendation and Benchmarking based on Multiagent Dual-core Learning

Enzyme Commission (EC) numbers, which associate a protein sequence with ...

Please sign up or login with your details

Forgot password? Click here to reset