Accelerating high-throughput virtual screening through molecular pool-based active learning

12/13/2020
by   David E. Graff, et al.
8

Structure-based virtual screening is an important tool in early stage drug discovery that scores the interactions between a target protein and candidate ligands. As virtual libraries continue to grow (in excess of 10^8 molecules), so too do the resources necessary to conduct exhaustive virtual screening campaigns on these libraries. However, Bayesian optimization techniques can aid in their exploration: a surrogate structure-property relationship model trained on the predicted affinities of a subset of the library can be applied to the remaining library members, allowing the least promising compounds to be excluded from evaluation. In this study, we assess various surrogate model architectures, acquisition functions, and acquisition batch sizes as applied to several protein-ligand docking datasets and observe significant reductions in computational costs, even when using a greedy acquisition strategy; for example, 87.9 a 100M member library. Such model-guided searches mitigate the increasing computational costs of screening increasingly large virtual libraries and can accelerate high-throughput virtual screening campaigns with applications beyond docking.

READ FULL TEXT

page 1

page 7

page 8

page 15

page 35

page 38

page 39

page 41

research
05/03/2022

Self-focusing virtual screening with active design space pruning

High-throughput virtual screening is an indispensable technique utilized...
research
09/20/2023

Large-scale Pretraining Improves Sample Efficiency of Active Learning based Molecule Virtual Screening

Virtual screening of large compound libraries to identify potential hit ...
research
08/31/2022

RAPTOR: Ravenous Throughput Computing

We describe the design, implementation and performance of the RADICAL-Pi...
research
06/06/2017

Parallel and Distributed Thompson Sampling for Large-scale Accelerated Exploration of Chemical Space

Chemical space is so large that brute force searches for new interesting...
research
09/23/2021

Optimal Decision Making in High-Throughput Virtual Screening Pipelines

Effective selection of the potential candidates that meet certain condit...
research
07/17/2023

Towards Automated Design of Riboswitches

Experimental screening and selection pipelines for the discovery of nove...
research
06/13/2021

Protein-Ligand Docking Surrogate Models: A SARS-CoV-2 Benchmark for Deep Learning Accelerated Virtual Screening

We propose a benchmark to study surrogate model accuracy for protein-lig...

Please sign up or login with your details

Forgot password? Click here to reset