Generalizable Protein Interface Prediction with End-to-End Learning

07/03/2018
by   Raphael J. L. Townshend, et al.
0

Predicting how proteins interact with one another - that is, which surfaces of one protein bind to which surfaces of another protein - is a central problem in biology. Here we present Siamese Atomic Surfacelet Network (SASNet), the first end-to-end learning method for protein interface prediction. Despite using only spatial coordinates and identities of atoms as inputs, SASNet outperforms state-of-the-art methods that rely on complex, hand-selected features. These results are particularly striking because we train the method entirely on a significantly biased data set that does not account for the fact that proteins deform when binding to one another. Nonetheless, our network maintains high performance, without retraining, when tested on real cases in which proteins do deform. This suggests that it has learned fundamental properties of protein structure and dynamics, which has important implications for a variety of key problems related to biomolecular structure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2020

PersGNN: Applying Topological Data Analysis and Geometric Deep Learning to Structure-Based Protein Function Prediction

Understanding protein structure-function relationships is a key challeng...
research
10/16/2021

Geometric Transformers for Protein Interface Contact Prediction

Computational methods for predicting the interface contacts between prot...
research
03/20/2023

FlexVDW: A machine learning approach to account for protein flexibility in ligand docking

Most widely used ligand docking methods assume a rigid protein structure...
research
11/27/2020

Protein model quality assessment using rotation-equivariant, hierarchical neural networks

Proteins are miniature machines whose function depends on their three-di...
research
12/07/2022

Dock2D: Synthetic data for the molecular recognition problem

Predicting the physical interaction of proteins is a cornerstone problem...
research
09/27/2016

Multiple protein feature prediction with statistical relational learning

High throughput sequencing techniques have highly impactedon modern biol...
research
06/08/2023

MC-NN: An End-to-End Multi-Channel Neural Network Approach for Predicting Influenza A Virus Hosts and Antigenic Types

Influenza poses a significant threat to public health, particularly amon...

Please sign up or login with your details

Forgot password? Click here to reset