Best-arm identification with cascading bandits

11/19/2018
by   Anant Gupta, et al.
0

We consider a variant of the problem of best arm identification in multi-arm bandits, where in each round, multiple arms are played in an ordered fashion until a nonzero reward is obtained. Since each round potentially provides information about more than one arm, the sample complexity can be much lower than in the standard formulation. We introduce a subroutine to perform uniform sampling, that allows us to adapt certain optimal algorithms for the standard version to this variant. As we prove, much of the analysis goes through as well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2020

Streaming Algorithms for Stochastic Multi-armed Bandits

We study the Stochastic Multi-armed Bandit problem under bounded arm-mem...
research
06/26/2021

The Role of Contextual Information in Best Arm Identification

We study the best-arm identification problem with fixed confidence when ...
research
04/02/2020

Predictive Bandits

We introduce and study a new class of stochastic bandit problems, referr...
research
08/23/2023

On Uniformly Optimal Algorithms for Best Arm Identification in Two-Armed Bandits with Fixed Budget

We study the problem of best-arm identification with fixed budget in sto...
research
06/06/2021

PAC Best Arm Identification Under a Deadline

We study (ϵ, δ)-PAC best arm identification, where a decision-maker must...
research
05/10/2014

Functional Bandits

We introduce the functional bandit problem, where the objective is to fi...
research
11/28/2022

On the Sample Complexity of Representation Learning in Multi-task Bandits with Global and Local structure

We investigate the sample complexity of learning the optimal arm for mul...

Please sign up or login with your details

Forgot password? Click here to reset