Optimal Best Arm Identification with Fixed Confidence

02/15/2016
by   Aurélien Garivier, et al.
0

We give a complete characterization of the complexity of best-arm identification in one-parameter bandit problems. We prove a new, tight lower bound on the sample complexity. We propose the `Track-and-Stop' strategy, which we prove to be asymptotically optimal. It consists in a new sampling rule (which tracks the optimal proportions of arm draws highlighted by the lower bound) and in a stopping rule named after Chernoff, for which we give a new analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2019

Optimal Best Markovian Arm Identification with Fixed Confidence

We give a complete characterization of the sampling complexity of best M...
research
01/25/2022

Almost Optimal Variance-Constrained Best Arm Identification

We design and analyze VA-LUCB, a parameter-free algorithm, for identifyi...
research
05/20/2019

Gradient Ascent for Active Exploration in Bandit Problems

We present a new algorithm based on an gradient ascent for a general Act...
research
10/24/2019

Fixed-Confidence Guarantees for Bayesian Best-Arm Identification

We investigate and provide new insights on the sampling rule called Top-...
research
05/27/2021

A Non-asymptotic Approach to Best-Arm Identification for Gaussian Bandits

We propose a new strategy for best-arm identification with fixed confide...
research
05/20/2020

Best Arm Identification in Spectral Bandits

We study best-arm identification with fixed confidence in bandit models ...
research
02/09/2019

Pure Exploration with Multiple Correct Answers

We determine the sample complexity of pure exploration bandit problems w...

Please sign up or login with your details

Forgot password? Click here to reset