Optimal Best-arm Identification in Linear Bandits

06/29/2020
by   Yassir Jedra, et al.
0

We study the problem of best-arm identification with fixed confidence in stochastic linear bandits. The objective is to identify the best arm with a given level of certainty while minimizing the sampling budget. We devise a simple algorithm whose sampling complexity matches known instance-specific lower bounds, asymptotically almost surely and in expectation. The algorithm relies on an arm sampling rule that tracks an optimal proportion of arm draws, and that remarkably can be updated as rarely as we wish, without compromising its theoretical guarantees. Moreover, unlike existing best-arm identification strategies, our algorithm uses a stopping rule that does not depend on the number of arms. Experimental results suggest that our algorithm significantly outperforms existing algorithms. The paper further provides a first analysis of the best-arm identification problem in linear bandits with a continuous set of arms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2021

Towards Minimax Optimal Best Arm Identification in Linear Bandits

We study the problem of best arm identification in linear bandits in the...
research
05/20/2019

Best Arm Identification in Generalized Linear Bandits

Motivated by drug design, we consider the best-arm identification proble...
research
05/25/2023

An ε-Best-Arm Identification Algorithm for Fixed-Confidence and Beyond

We propose EB-TCε, a novel sampling rule for ε-best arm identification i...
research
08/23/2023

On Uniformly Optimal Algorithms for Best Arm Identification in Two-Armed Bandits with Fixed Budget

We study the problem of best-arm identification with fixed budget in sto...
research
04/14/2022

Measurement-based Admission Control in Sliced Networks: A Best Arm Identification Approach

In sliced networks, the shared tenancy of slices requires adaptive admis...
research
11/05/2019

Towards Optimal and Efficient Best Arm Identification in Linear Bandits

We give a new algorithm for best arm identification in linearly paramete...
research
02/09/2022

Finding Optimal Arms in Non-stochastic Combinatorial Bandits with Semi-bandit Feedback and Finite Budget

We consider the combinatorial bandits problem with semi-bandit feedback ...

Please sign up or login with your details

Forgot password? Click here to reset