Targeted active learning for probabilistic models

10/21/2022
by   Christopher Tosh, et al.
18

A fundamental task in science is to design experiments that yield valuable insights about the system under study. Mathematically, these insights can be represented as a utility or risk function that shapes the value of conducting each experiment. We present PDBAL, a targeted active learning method that adaptively designs experiments to maximize scientific utility. PDBAL takes a user-specified risk function and combines it with a probabilistic model of the experimental outcomes to choose designs that rapidly converge on a high-utility model. We prove theoretical bounds on the label complexity of PDBAL and provide fast closed-form solutions for designing experiments with common exponential family likelihoods. In simulation studies, PDBAL consistently outperforms standard untargeted approaches that focus on maximizing expected information gain over the design space. Finally, we demonstrate the scientific potential of PDBAL through a study on a large cancer drug screen dataset where PDBAL quickly recovers the most efficacious drugs with a small fraction of the total number of experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2022

Benchmarking Active Learning Strategies for Materials Optimization and Discovery

Autonomous physical science is revolutionizing materials science. In the...
research
09/06/2022

Modeling and Active Learning for Experiments with Quantitative-Sequence Factors

A new type of experiment that aims to determine the optimal quantities o...
research
03/19/2018

Bayesian design of experiments for intractable likelihood models using coupled auxiliary models and multivariate emulation

A Bayesian design is given by maximising the expected utility over the d...
research
12/29/2021

Active Learning-Based Optimization of Scientific Experimental Design

Active learning (AL) is a machine learning algorithm that can achieve gr...
research
04/09/2015

Deciding when to stop: Efficient stopping of active learning guided drug-target prediction

Active learning has shown to reduce the number of experiments needed to ...
research
01/07/2022

On robust risk-based active-learning algorithms for enhanced decision support

Classification models are a fundamental component of physical-asset mana...
research
09/14/2021

Automatic Reuse, Adaption, and Execution of Simulation Experiments via Provenance Patterns

Simulation experiments are typically conducted repeatedly during the mod...

Please sign up or login with your details

Forgot password? Click here to reset