Towards Optimal and Efficient Best Arm Identification in Linear Bandits

11/05/2019
by   Mohammadi Zaki, et al.
0

We give a new algorithm for best arm identification in linearly parameterised bandits in the fixed confidence setting. The algorithm generalises the well-known LUCB algorithm of Kalyanakrishnan et al. (2012) by playing an arm which minimises a suitable notion of geometric overlap of the statistical confidence set for the unknown parameter, and is fully adaptive and computationally efficient as compared to several state-of-the methods. We theoretically analyse the sample complexity of the algorithm for problems with two and three arms, showing optimality in many cases. Numerical results indicate favourable performance over other algorithms with which we compare.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2019

Best Arm Identification in Generalized Linear Bandits

Motivated by drug design, we consider the best-arm identification proble...
research
06/29/2020

Optimal Best-arm Identification in Linear Bandits

We study the problem of best-arm identification with fixed confidence in...
research
06/13/2020

Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners

We study the problem of best arm identification in linearly parameterise...
research
03/18/2021

Top-m identification for linear bandits

Motivated by an application to drug repurposing, we propose the first al...
research
05/24/2022

Optimality Conditions and Algorithms for Top-K Arm Identification

We consider the top-k arm identification problem for multi-armed bandits...
research
05/20/2019

Gradient Ascent for Active Exploration in Bandit Problems

We present a new algorithm based on an gradient ascent for a general Act...
research
03/02/2023

Open Problem: Optimal Best Arm Identification with Fixed Budget

Best arm identification or pure exploration problems have received much ...

Please sign up or login with your details

Forgot password? Click here to reset