Misspecified Gaussian Process Bandit Optimization

11/09/2021
by   Ilija Bogunovic, et al.
8

We consider the problem of optimizing a black-box function based on noisy bandit feedback. Kernelized bandit algorithms have shown strong empirical and theoretical performance for this problem. They heavily rely on the assumption that the model is well-specified, however, and can fail without it. Instead, we introduce a misspecified kernelized bandit setting where the unknown function can be ϵ–uniformly approximated by a function with a bounded norm in some Reproducing Kernel Hilbert Space (RKHS). We design efficient and practical algorithms whose performance degrades minimally in the presence of model misspecification. Specifically, we present two algorithms based on Gaussian process (GP) methods: an optimistic EC-GP-UCB algorithm that requires knowing the misspecification error, and Phased GP Uncertainty Sampling, an elimination-type algorithm that can adapt to unknown model misspecification. We provide upper bounds on their cumulative regret in terms of ϵ, the time horizon, and the underlying kernel, and we show that our algorithm achieves optimal dependence on ϵ with no prior knowledge of misspecification. In addition, in a stochastic contextual setting, we show that EC-GP-UCB can be effectively combined with the regret bound balancing strategy and attain similar regret bounds despite not knowing ϵ.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2017

Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

In this paper, we consider the problem of sequentially optimizing a blac...
research
03/04/2020

Corruption-Tolerant Gaussian Process Bandit Optimization

We consider the problem of optimizing an unknown (typically non-convex) ...
research
02/11/2021

No-Regret Algorithms for Time-Varying Bayesian Optimization

In this paper, we consider the time-varying Bayesian optimization proble...
research
01/28/2020

Bandit optimisation of functions in the Matérn kernel RKHS

We consider the problem of optimising functions in the Reproducing kerne...
research
03/15/2022

Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization

The expected improvement (EI) algorithm is one of the most popular strat...
research
08/20/2020

On Lower Bounds for Standard and Robust Gaussian Process Bandit Optimization

In this paper, we consider algorithm-independent lower bounds for the pr...
research
10/19/2015

Optimization for Gaussian Processes via Chaining

In this paper, we consider the problem of stochastic optimization under ...

Please sign up or login with your details

Forgot password? Click here to reset