APObind: A Dataset of Ligand Unbound Protein Conformations for Machine Learning Applications in De Novo Drug Design

08/23/2021
by   Rishal Aggarwal, et al.
0

Protein-ligand complex structures have been utilised to design benchmark machine learning methods that perform important tasks related to drug design such as receptor binding site detection, small molecule docking and binding affinity prediction. However, these methods are usually trained on only ligand bound (or holo) conformations of the protein and therefore are not guaranteed to perform well when the protein structure is in its native unbound conformation (or apo), which is usually the conformation available for a newly identified receptor. A primary reason for this is that the local structure of the binding site usually changes upon ligand binding. To facilitate solutions for this problem, we propose a dataset called APObind that aims to provide apo conformations of proteins present in the PDBbind dataset, a popular dataset used in drug design. Furthermore, we explore the performance of methods specific to three use cases on this dataset, through which, the importance of validating them on the APObind dataset is demonstrated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2022

Structure-based Drug Design with Equivariant Diffusion Models

Structure-based drug design (SBDD) aims to design small-molecule ligands...
research
04/13/2019

Detection of protein-ligand binding sites with 3D segmentation

In recent years machine learning (ML) took bio- and cheminformatics fiel...
research
05/03/2012

An Evolutionary Approach to Drug-Design Using Quantam Binary Particle Swarm Optimization Algorithm

The present work provides a new approach to evolve ligand structures whi...
research
01/16/2023

PlasmoFAB: A Benchmark to Foster Machine Learning for Plasmodium falciparum Protein Antigen Candidate Prediction

Motivation: Machine learning methods can be used to support scientific d...
research
05/21/2022

DProQ: A Gated-Graph Transformer for Protein Complex Structure Assessment

Proteins interact to form complexes to carry out essential biological fu...
research
04/05/2020

One-shot screening of potential peptide ligands on HR1 domain in COVID-19 glycosylated spike (S) protein with deep siamese network

The novel coronavirus (2019-nCoV) has been declared to be a new internat...
research
05/03/2012

An Evolutionary Approach to Drug-Design Using a Novel Neighbourhood Based Genetic Algorithm

The present work provides a new approach to evolve ligand structures whi...

Please sign up or login with your details

Forgot password? Click here to reset