The Role of Contextual Information in Best Arm Identification

06/26/2021
by   Masahiro Kato, et al.
0

We study the best-arm identification problem with fixed confidence when contextual (covariate) information is available in stochastic bandits. Although we can use contextual information in each round, we are interested in the marginalized mean reward over the contextual distribution. Our goal is to identify the best arm with a minimal number of samplings under a given value of the error rate. We show the instance-specific sample complexity lower bounds for the problem. Then, we propose a context-aware version of the "Track-and-Stop" strategy, wherein the proportion of the arm draws tracks the set of optimal allocations and prove that the expected number of arm draws matches the lower bound asymptotically. We demonstrate that contextual information can be used to improve the efficiency of the identification of the best marginalized mean reward compared with the results of Garivier Kaufmann (2016). We experimentally confirm that context information contributes to faster best-arm identification.

READ FULL TEXT
research
09/15/2022

Semiparametric Best Arm Identification with Contextual Information

We study best-arm identification with a fixed budget and contextual (cov...
research
11/19/2018

Best-arm identification with cascading bandits

We consider a variant of the problem of best arm identification in multi...
research
12/02/2019

Optimal Best Markovian Arm Identification with Fixed Confidence

We give a complete characterization of the sampling complexity of best M...
research
05/29/2017

Improving the Expected Improvement Algorithm

The expected improvement (EI) algorithm is a popular strategy for inform...
research
11/16/2017

Bayesian Best-Arm Identification for Selecting Influenza Mitigation Strategies

Pandemic influenza has the epidemic potential to kill millions of people...
research
01/06/2022

Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach

Controlling antenna tilts in cellular networks is imperative to reach an...
research
04/14/2022

Measurement-based Admission Control in Sliced Networks: A Best Arm Identification Approach

In sliced networks, the shared tenancy of slices requires adaptive admis...

Please sign up or login with your details

Forgot password? Click here to reset