Analysis of Thompson Sampling for Gaussian Process Optimization in the Bandit Setting

05/18/2017
by   Kinjal Basu, et al.
0

We consider the global optimization of a function over a continuous domain. At every evaluation attempt, we can observe the function at a chosen point in the domain and we reap the reward of the value observed. We assume that drawing these observations are expensive and noisy. We frame it as a continuum-armed bandit problem with a Gaussian Process prior on the function. In this regime, most algorithms have been developed to minimize some form of regret. Contrary to this popular norm, in this paper, we study the convergence of the sequential point x^t to the global optimizer x^* for the Thompson Sampling approach. Under some very mild assumptions, we show that the point sequence convergences to the true optimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2016

A sequential Monte Carlo approach to Thompson sampling for Bayesian optimization

Bayesian optimization through Gaussian process regression is an effectiv...
research
05/31/2017

Lower Bounds on Regret for Noisy Gaussian Process Bandit Optimization

In this paper, we consider the problem of sequentially optimizing a blac...
research
06/27/2012

Parallelizing Exploration-Exploitation Tradeoffs with Gaussian Process Bandit Optimization

Can one parallelize complex exploration exploitation tradeoffs? As an ex...
research
01/18/2011

Convergence rates of efficient global optimization algorithms

Efficient global optimization is the problem of minimizing an unknown fu...
research
10/27/2020

A Computationally Efficient Approach to Black-box Optimization using Gaussian Process Models

We consider the sequential optimization of an unknown function from nois...
research
03/04/2020

Corruption-Tolerant Gaussian Process Bandit Optimization

We consider the problem of optimizing an unknown (typically non-convex) ...
research
10/29/2020

Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective

Achieving the full promise of the Thermodynamic Variational Objective (T...

Please sign up or login with your details

Forgot password? Click here to reset