An Analysis of Ensemble Sampling

03/02/2022
by   Chao Qin, et al.
0

Ensemble sampling serves as a practical approximation to Thompson sampling when maintaining an exact posterior distribution over model parameters is computationally intractable. In this paper, we establish a Bayesian regret bound that ensures desirable behavior when ensemble sampling is applied to the linear bandit problem. This represents the first rigorous regret analysis of ensemble sampling and is made possible by leveraging information-theoretic concepts and novel analytic techniques that may prove useful beyond the scope of this paper.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2022

Nonstationary Bandit Learning via Predictive Sampling

We propose predictive sampling as an approach to selecting actions that ...
research
05/20/2017

Ensemble Sampling

Thompson sampling has emerged as an effective heuristic for a broad rang...
research
01/24/2020

Ensemble Rejection Sampling

We introduce Ensemble Rejection Sampling, a scheme for exact simulation ...
research
06/17/2020

Analysis and Design of Thompson Sampling for Stochastic Partial Monitoring

We investigate finite stochastic partial monitoring, which is a general ...
research
07/19/2023

VITS : Variational Inference Thomson Sampling for contextual bandits

In this paper, we introduce and analyze a variant of the Thompson sampli...
research
01/06/2022

Gaussian Imagination in Bandit Learning

Assuming distributions are Gaussian often facilitates computations that ...
research
01/15/2021

A Bayesian perspective on sampling of alternatives

In this paper, we apply a Bayesian perspective to sampling of alternativ...

Please sign up or login with your details

Forgot password? Click here to reset