Sub-sampling for Efficient Non-Parametric Bandit Exploration

10/27/2020
by   Dorian Baudry, et al.
0

In this paper we propose the first multi-armed bandit algorithm based on re-sampling that achieves asymptotically optimal regret simultaneously for different families of arms (namely Bernoulli, Gaussian and Poisson distributions). Unlike Thompson Sampling which requires to specify a different prior to be optimal in each case, our proposal RB-SDA does not need any distribution-dependent tuning. RB-SDA belongs to the family of Sub-sampling Duelling Algorithms (SDA) which combines the sub-sampling idea first used by the BESA [1] and SSMC [2] algorithms with different sub-sampling schemes. In particular, RB-SDA uses Random Block sampling. We perform an experimental study assessing the flexibility and robustness of this promising novel approach for exploration in bandit models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/10/2023

A General Recipe for the Analysis of Randomized Multi-Armed Bandit Algorithms

In this paper we propose a general methodology to derive regret bounds f...
research
03/13/2018

Thompson Sampling for Combinatorial Semi-Bandits

We study the application of the Thompson Sampling (TS) methodology to th...
research
06/11/2020

Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits

We investigate stochastic combinatorial multi-armed bandit with semi-ban...
research
08/16/2017

Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors

Thompson sampling has impressive empirical performance for many multi-ar...
research
07/01/2015

Bootstrapped Thompson Sampling and Deep Exploration

This technical note presents a new approach to carrying out the kind of ...
research
02/22/2017

Approximations of the Restless Bandit Problem

The multi-armed restless bandit problem is studied in the case where the...
research
05/19/2021

Diffusion Approximations for Thompson Sampling

We study the behavior of Thompson sampling from the perspective of weak ...

Please sign up or login with your details

Forgot password? Click here to reset