Optimal Decision Making in High-Throughput Virtual Screening Pipelines

09/23/2021
by   Hyun-Myung Woo, et al.
1

Effective selection of the potential candidates that meet certain conditions in a tremendously large search space has been one of the major concerns in many real-world applications. In addition to the nearly infinitely large search space, rigorous evaluation of a sample based on the reliable experimental or computational platform is often prohibitively expensive, making the screening problem more challenging. In such a case, constructing a high-throughput screening (HTS) pipeline that pre-sifts the samples expected to be potential candidates through the efficient earlier stages, results in a significant amount of savings in resources. However, to the best of our knowledge, despite many successful applications, no one has studied optimal pipeline design or optimal pipeline operations. In this study, we propose two optimization frameworks, applying to most (if not all) screening campaigns involving experimental or/and computational evaluations, for optimally determining the screening thresholds of an HTS pipeline. We validate the proposed frameworks on both analytic and practical scenarios. In particular, we consider the optimal computational campaign for the long non-coding RNA (lncRNA) classification as a practical example. To accomplish this, we built the high-throughput virtual screening (HTVS) pipeline for classifying the lncRNA. The simulation results demonstrate that the proposed frameworks significantly reduce the effective selection cost per potential candidate and make the HTS pipelines less sensitive to their structural variations. In addition to the validation, we provide insights on constructing a better HTS pipeline based on the simulation results.

READ FULL TEXT

page 2

page 3

page 4

page 6

page 7

page 8

page 12

page 13

research
12/13/2020

Accelerating high-throughput virtual screening through molecular pool-based active learning

Structure-based virtual screening is an important tool in early stage dr...
research
11/30/2021

A Review on Parallel Virtual Screening Softwares for High Performance Computers

Drug discovery is the most expensive, time demanding and challenging pro...
research
04/09/2021

High-Throughput Virtual Screening of Small Molecule Inhibitors for SARS-CoV-2 Protein Targets with Deep Fusion Models

Structure-based Deep Fusion models were recently shown to outperform sev...
research
07/17/2023

Towards Automated Design of Riboswitches

Experimental screening and selection pipelines for the discovery of nove...
research
07/03/2018

Segmented correspondence curve regression model for quantifying reproducibility of high-throughput experiments

The reliability of a high-throughput biological experiment relies highly...
research
02/14/2022

Machine Learning-Aided Discovery of Superionic Solid-State Electrolyte for Li-Ion Batteries

Li-Ion Solid-State Electrolytes (Li-SSEs) are a promising solution that ...
research
02/02/2022

Improving Screening Processes via Calibrated Subset Selection

Many selection processes such as finding patients qualifying for a medic...

Please sign up or login with your details

Forgot password? Click here to reset