Towards Minimax Optimal Best Arm Identification in Linear Bandits

05/27/2021
by   Junwen Yang, et al.
0

We study the problem of best arm identification in linear bandits in the fixed-budget setting. By leveraging properties of the G-optimal design and incorporating it into the arm allocation rule, we design a parameter-free algorithm, Optimal Design-based Linear Best Arm Identification (OD-LinBAI). We provide a theoretical analysis of the failure probability of OD-LinBAI. While the performances of existing methods (e.g., BayesGap) depend on all the optimality gaps, OD-LinBAI depends on the gaps of the top d arms, where d is the effective dimension of the linear bandit instance. Furthermore, we present a minimax lower bound for this problem. The upper and lower bounds show that OD-LinBAI is minimax optimal up to multiplicative factors in the exponent. Finally, numerical experiments corroborate our theoretical findings.

READ FULL TEXT
research
06/29/2020

Optimal Best-arm Identification in Linear Bandits

We study the problem of best-arm identification with fixed confidence in...
research
10/15/2020

Probabilistic Sequential Shrinking: A Best Arm Identification Algorithm for Stochastic Bandits with Corruptions

We consider a best arm identification (BAI) problem for stochastic bandi...
research
05/20/2019

Best Arm Identification in Generalized Linear Bandits

Motivated by drug design, we consider the best-arm identification proble...
research
07/27/2023

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

We investigate the fixed-budget best-arm identification (BAI) problem fo...
research
04/14/2022

Measurement-based Admission Control in Sliced Networks: A Best Arm Identification Approach

In sliced networks, the shared tenancy of slices requires adaptive admis...
research
12/22/2020

Refined bounds for randomized experimental design

Experimental design is an approach for selecting samples among a given s...
research
09/16/2021

Policy Choice and Best Arm Identification: Comments on "Adaptive Treatment Assignment in Experiments for Policy Choice"

Adaptive experimental design for efficient decision-making is an importa...

Please sign up or login with your details

Forgot password? Click here to reset