Boosting Convolutional Neural Networks' Protein Binding Site Prediction Capacity Using SE(3)-invariant transformers, Transfer Learning and Homology-based Augmentation

02/20/2023
by   Daeseok Lee, et al.
0

Figuring out small molecule binding sites in target proteins, in the resolution of either pocket or residue, is critical in many virtual and real drug-discovery scenarios. Since it is not always easy to find such binding sites based on domain knowledge or traditional methods, different deep learning methods that predict binding sites out of protein structures have been developed in recent years. Here we present a new such deep learning algorithm, that significantly outperformed all state-of-the-art baselines in terms of the both resolutionsx2013pocket and residue. This good performance was also demonstrated in a case study involving the protein human serum albumin and its binding sites. Our algorithm included new ideas both in the model architecture and in the training method. For the model architecture, it incorporated SE(3)-invariant geometric self-attention layers that operate on top of residue-level CNN outputs. This residue-level processing of the model allowed a transfer learning between the two resolutions, which turned out to significantly improve the binding pocket prediction. Moreover, we developed novel augmentation method based on protein homology, which prevented our model from over-fitting. Overall, we believe that our contribution to the literature is twofold. First, we provided a new computational method for binding site prediction that is relevant to real-world applications, as shown by the good performance on different benchmarks and case study. Second, the novel ideas in our methodx2013the model architecture, transfer learning and the homology augmentationx2013would serve as useful components in future works.

READ FULL TEXT
research
03/18/2020

Site2Vec: a reference frame invariant algorithm for vector embedding of protein-ligand binding sites

Protein-ligand interactions are one of the fundamental types of molecula...
research
02/13/2020

DeepSurf: A surface-based deep learning approach for the prediction of ligand binding sites on proteins

The knowledge of potentially druggable binding sites on proteins is an i...
research
11/06/2018

DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences

Identification of drug-target interactions (DTIs) plays a key role in dr...
research
07/25/2018

PADME: A Deep Learning-based Framework for Drug-Target Interaction Prediction

In silico Drug-target Interaction (DTI) prediction is an important and c...
research
02/14/2023

Do Deep Learning Models Really Outperform Traditional Approaches in Molecular Docking?

Molecular docking, given a ligand molecule and a ligand binding site (ca...
research
06/20/2018

DeepAffinity: Interpretable Deep Learning of Compound-Protein Affinity through Unified Recurrent and Convolutional Neural Networks

Motivation: Drug discovery demands rapid quantification of compound-prot...
research
05/03/2012

An Evolutionary Approach to Drug-Design Using a Novel Neighbourhood Based Genetic Algorithm

The present work provides a new approach to evolve ligand structures whi...

Please sign up or login with your details

Forgot password? Click here to reset