Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior

11/23/2018
by   Zi Wang, et al.
0

Bayesian optimization usually assumes that a Bayesian prior is given. However, the strong theoretical guarantees in Bayesian optimization are often regrettably compromised in practice because of unknown parameters in the prior. In this paper, we adopt a variant of empirical Bayes and show that, by estimating the Gaussian process prior from offline data sampled from the same prior and constructing unbiased estimators of the posterior, variants of both GP-UCB and probability of improvement achieve a near-zero regret bound, which decreases to a constant proportional to the observational noise as the number of offline data and the number of online evaluations increase. Empirically, we have verified our approach on challenging simulated robotic problems featuring task and motion planning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2019

On Batch Bayesian Optimization

We present two algorithms for Bayesian optimization in the batch feedbac...
research
06/14/2022

On Provably Robust Meta-Bayesian Optimization

Bayesian optimization (BO) has become popular for sequential optimizatio...
research
06/02/2021

JUMBO: Scalable Multi-task Bayesian Optimization using Offline Data

The goal of Multi-task Bayesian Optimization (MBO) is to minimize the nu...
research
01/03/2021

Meta-Learning Conjugate Priors for Few-Shot Bayesian Optimization

Bayesian Optimization is methodology used in statistical modelling that ...
research
06/12/2023

Provably Efficient Bayesian Optimization with Unbiased Gaussian Process Hyperparameter Estimation

Gaussian process (GP) based Bayesian optimization (BO) is a powerful met...
research
04/01/2019

Bayesian Optimization for Policy Search via Online-Offline Experimentation

Online field experiments are the gold-standard way of evaluating changes...
research
08/23/2022

Event-Triggered Time-Varying Bayesian Optimization

We consider the problem of sequentially optimizing a time-varying object...

Please sign up or login with your details

Forgot password? Click here to reset