Bayesian Optimization for Policy Search via Online-Offline Experimentation

04/01/2019
by   Benjamin Letham, et al.
0

Online field experiments are the gold-standard way of evaluating changes to real-world interactive machine learning systems. Yet our ability to explore complex, multi-dimensional policy spaces - such as those found in recommendation and ranking problems - is often constrained by the limited number of experiments that can be run simultaneously. To alleviate these constraints, we augment online experiments with an offline simulator and apply multi-task Bayesian optimization to tune live machine learning systems. We describe practical issues that arise in these types of applications, including biases that arise from using a simulator and assumptions for the multi-task kernel. We measure empirical learning curves which show substantial gains from including data from biased offline experiments, and show how these learning curves are consistent with theoretical results for multi-task Gaussian process generalization. We find that improved kernel inference is a significant driver of multi-task generalization. Finally, we show several examples of Bayesian optimization efficiently tuning a live machine learning system by combining offline and online experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2020

MUMBO: MUlti-task Max-value Bayesian Optimization

We propose MUMBO, the first high-performing yet computationally efficien...
research
06/02/2021

JUMBO: Scalable Multi-task Bayesian Optimization using Offline Data

The goal of Multi-task Bayesian Optimization (MBO) is to minimize the nu...
research
11/23/2018

Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior

Bayesian optimization usually assumes that a Bayesian prior is given. Ho...
research
03/17/2020

Multi-action Offline Policy Learning with Bayesian Optimization

We study an offline multi-action policy learning algorithm based on doub...
research
06/24/2021

Bayesian Optimization with High-Dimensional Outputs

Bayesian Optimization is a sample-efficient black-box optimization proce...
research
06/13/2012

Practical Bayesian Optimization of Machine Learning Algorithms

Machine learning algorithms frequently require careful tuning of model h...
research
06/21/2017

Constrained Bayesian Optimization with Noisy Experiments

Randomized experiments are the gold standard for evaluating the effects ...

Please sign up or login with your details

Forgot password? Click here to reset