Self-focusing virtual screening with active design space pruning

05/03/2022
by   David E. Graff, et al.
3

High-throughput virtual screening is an indispensable technique utilized in the discovery of small molecules. In cases where the library of molecules is exceedingly large, the cost of an exhaustive virtual screen may be prohibitive. Model-guided optimization has been employed to lower these costs through dramatic increases in sample efficiency compared to random selection. However, these techniques introduce new costs to the workflow through the surrogate model training and inference steps. In this study, we propose an extension to the framework of model-guided optimization that mitigates inferences costs using a technique we refer to as design space pruning (DSP), which irreversibly removes poor-performing candidates from consideration. We study the application of DSP to a variety of optimization tasks and observe significant reductions in overhead costs while exhibiting similar performance to the baseline optimization. DSP represents an attractive extension of model-guided optimization that can limit overhead costs in optimization settings where these costs are non-negligible relative to objective costs, such as docking.

READ FULL TEXT

page 1

page 3

page 26

page 30

page 32

page 33

page 35

page 41

research
12/13/2020

Accelerating high-throughput virtual screening through molecular pool-based active learning

Structure-based virtual screening is an important tool in early stage dr...
research
09/20/2023

Large-scale Pretraining Improves Sample Efficiency of Active Learning based Molecule Virtual Screening

Virtual screening of large compound libraries to identify potential hit ...
research
06/13/2021

Protein-Ligand Docking Surrogate Models: A SARS-CoV-2 Benchmark for Deep Learning Accelerated Virtual Screening

We propose a benchmark to study surrogate model accuracy for protein-lig...
research
03/25/2020

Large-scale ligand-based virtual screening for SARS-CoV-2 inhibitors using deep neural networks

Due to the current severe acute respiratory syndrome coronavirus 2 (SARS...
research
01/28/2022

FastFlows: Flow-Based Models for Molecular Graph Generation

We propose a framework using normalizing-flow based models, SELF-Referen...
research
10/09/2020

Model Exploration with Cost-Aware Learning

We present an extension to active learning routines in which non-constan...

Please sign up or login with your details

Forgot password? Click here to reset