Protein Folding Neural Networks Are Not Robust

09/09/2021
by   Sumit Kumar Jha, et al.
0

Deep neural networks such as AlphaFold and RoseTTAFold predict remarkably accurate structures of proteins compared to other algorithmic approaches. It is known that biologically small perturbations in the protein sequence do not lead to drastic changes in the protein structure. In this paper, we demonstrate that RoseTTAFold does not exhibit such a robustness despite its high accuracy, and biologically small perturbations for some input sequences result in radically different predicted protein structures. This raises the challenge of detecting when these predicted protein structures cannot be trusted. We define the robustness measure for the predicted structure of a protein sequence to be the inverse of the root-mean-square distance (RMSD) in the predicted structure and the structure of its adversarially perturbed sequence. We use adversarial attack methods to create adversarial protein sequences, and show that the RMSD in the predicted protein structure ranges from 0.119Å to 34.162Å when the adversarial perturbations are bounded by 20 units in the BLOSUM62 distance. This demonstrates very high variance in the robustness measure of the predicted structures. We show that the magnitude of the correlation (0.917) between our robustness measure and the RMSD between the predicted structure and the ground truth is high, that is, the predictions with low robustness measure cannot be trusted. This is the first paper demonstrating the susceptibility of RoseTTAFold to adversarial attacks.

READ FULL TEXT
research
01/10/2023

On the Robustness of AlphaFold: A COVID-19 Case Study

Protein folding neural networks (PFNNs) such as AlphaFold predict remark...
research
05/15/2023

AF2-Mutation: Adversarial Sequence Mutations against AlphaFold2 on Protein Tertiary Structure Prediction

Deep learning-based approaches, such as AlphaFold2 (AF2), have significa...
research
10/07/2019

Weighted graphlets and deep neural networks for protein structure classification

As proteins with similar structures often have similar functions, analys...
research
05/26/2022

DRLComplex: Reconstruction of protein quaternary structures using deep reinforcement learning

Predicted inter-chain residue-residue contacts can be used to build the ...
research
06/17/2018

MCP: a multi-component learning machine for prediction of protein secondary structure

Proteins biological function is tightly connected to its specific 3D str...
research
09/02/2014

CoMOGrad and PHOG: From Computer Vision to Fast and Accurate Protein Tertiary Structure Retrieval

Due to the advancements in technology number of entries in the structura...
research
05/11/2021

EBM-Fold: Fully-Differentiable Protein Folding Powered by Energy-based Models

Accurate protein structure prediction from amino-acid sequences is criti...

Please sign up or login with your details

Forgot password? Click here to reset