Best Arm Identification in Stochastic Bandits: Beyond β-optimality

01/10/2023
by   Arpan Mukherjee, et al.
0

This paper focuses on best arm identification (BAI) in stochastic multi-armed bandits (MABs) in the fixed-confidence, parametric setting. In such pure exploration problems, the accuracy of the sampling strategy critically hinges on the sequential allocation of the sampling resources among the arms. The existing approaches to BAI address the following question: what is an optimal sampling strategy when we spend a β fraction of the samples on the best arm? These approaches treat β as a tunable parameter and offer efficient algorithms that ensure optimality up to selecting β, hence β-optimality. However, the BAI decisions and performance can be highly sensitive to the choice of β. This paper provides a BAI algorithm that is agnostic to β, dispensing with the need for tuning β, and specifies an optimal allocation strategy, including the optimal value of β. Furthermore, the existing relevant literature focuses on the family of exponential distributions. This paper considers a more general setting of any arbitrary family of distributions parameterized by their mean values (under mild regularity conditions).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2022

Optimality Conditions and Algorithms for Top-K Arm Identification

We consider the top-k arm identification problem for multi-armed bandits...
research
07/02/2020

Gamification of Pure Exploration for Linear Bandits

We investigate an active pure-exploration setting, that includes best-ar...
research
07/22/2022

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

This paper investigates the best arm identification (BAI) problem in sto...
research
08/28/2020

Statistically Robust, Risk-Averse Best Arm Identification in Multi-Armed Bandits

Traditional multi-armed bandit (MAB) formulations usually make certain a...
research
06/13/2022

Top Two Algorithms Revisited

Top Two algorithms arose as an adaptation of Thompson sampling to best a...
research
12/08/2021

Best Arm Identification under Additive Transfer Bandits

We consider a variant of the best arm identification (BAI) problem in mu...
research
05/27/2019

The bias of the sample mean in multi-armed bandits can be positive or negative

It is well known that in stochastic multi-armed bandits (MAB), the sampl...

Please sign up or login with your details

Forgot password? Click here to reset