Bayes Optimal Informer Sets for Early-Stage Drug Discovery

11/11/2020
by   Peng Yu, et al.
0

An important experimental design problem in early-stage drug discovery is how to prioritize available compounds for testing when very little is known about the target protein. Informer based ranking (IBR) methods address the prioritization problem when the compounds have provided bioactivity data on other potentially relevant targets. An IBR method selects an informer set of compounds, and then prioritizes the remaining compounds on the basis of new bioactivity experiments performed with the informer set on the target. We formalize the problem as a two-stage decision problem and introduce the Bayes Optimal Informer SEt (BOISE) method for its solution. BOISE leverages a flexible model of the initial bioactivity data, a relevant loss function, and effective computational schemes to resolve the two-step design problem. We evaluate BOISE and compare it to other IBR strategies in two retrospective studies, one on protein-kinase inhibition and the other on anti-cancer drug sensitivity. In both empirical settings BOISE exhibits better predictive performance than available methods. It also behaves well with missing data, where methods that use matrix completion show worse predictive performance. We provide an R implementation of BOISE at https://github.com/wiscstatman/esdd/BOISE

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 3

page 9

page 10

page 11

page 12

page 13

09/25/2020

GEFA: Early Fusion Approach in Drug-Target Affinity Prediction

Predicting the interaction between a compound and a target is crucial fo...
08/18/2018

Network-based Biased Tree Ensembles (NetBiTE) for Drug Sensitivity Prediction and Drug Sensitivity Biomarker Identification in Cancer

We present the Network-based Biased Tree Ensembles (NetBiTE) method for ...
07/09/2020

Identifying efficient controls of complex interaction networks using genetic algorithms

Control theory has seen recently impactful applications in network scien...
10/22/2021

GeneDisco: A Benchmark for Experimental Design in Drug Discovery

In vitro cellular experimentation with genetic interventions, using for ...
10/07/2020

Combination of digital signal processing and assembled predictive models facilitates the rational design of proteins

Predicting the effect of mutations in proteins is one of the most critic...
06/08/2018

Black Box FDR

Analyzing large-scale, multi-experiment studies requires scientists to t...
09/17/2021

Proteome-informed machine learning studies of cocaine addiction

Cocaine addiction accounts for a large portion of substance use disorder...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.