Best Arm Identification under Additive Transfer Bandits

12/08/2021
by   Ojash Neopane, et al.
0

We consider a variant of the best arm identification (BAI) problem in multi-armed bandits (MAB) in which there are two sets of arms (source and target), and the objective is to determine the best target arm while only pulling source arms. In this paper, we study the setting when, despite the means being unknown, there is a known additive relationship between the source and target MAB instances. We show how our framework covers a range of previously studied pure exploration problems and additionally captures new problems. We propose and theoretically analyze an LUCB-style algorithm to identify an ϵ-optimal target arm with high probability. Our theoretical analysis highlights aspects of this transfer learning problem that do not arise in the typical BAI setup, and yet recover the LUCB algorithm for single domain BAI as a special case.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2021

Best-Arm Identification in Correlated Multi-Armed Bandits

In this paper we consider the problem of best-arm identification in mult...
research
05/20/2019

Best Arm Identification in Generalized Linear Bandits

Motivated by drug design, we consider the best-arm identification proble...
research
06/16/2020

Finding All ε-Good Arms in Stochastic Bandits

The pure-exploration problem in stochastic multi-armed bandits aims to f...
research
08/02/2021

Pure Exploration in Multi-armed Bandits with Graph Side Information

We study pure exploration in multi-armed bandits with graph side-informa...
research
03/29/2018

Best arm identification in multi-armed bandits with delayed feedback

We propose a generalization of the best arm identification problem in st...
research
03/05/2020

Robustness Guarantees for Mode Estimation with an Application to Bandits

Mode estimation is a classical problem in statistics with a wide range o...
research
01/10/2023

Best Arm Identification in Stochastic Bandits: Beyond β-optimality

This paper focuses on best arm identification (BAI) in stochastic multi-...

Please sign up or login with your details

Forgot password? Click here to reset