Best Arm Identification for Stochastic Rising Bandits

02/15/2023
by   Marco Mussi, et al.
0

Stochastic Rising Bandits is a setting in which the values of the expected rewards of the available options increase every time they are selected. This framework models a wide range of scenarios in which the available options are learning entities whose performance improves over time. In this paper, we focus on the Best Arm Identification (BAI) problem for the stochastic rested rising bandits. In this scenario, we are asked, given a fixed budget of rounds, to provide a recommendation about the best option at the end of the selection process. We propose two algorithms to tackle the above-mentioned setting, namely R-UCBE, which resorts to a UCB-like approach, and R-SR, which employs a successive reject procedure. We show that they provide guarantees on the probability of properly identifying the optimal option at the end of the learning process. Finally, we numerically validate the proposed algorithms in synthetic and realistic environments and compare them with the currently available BAI strategies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2022

Choosing Answers in ε-Best-Answer Identification for Linear Bandits

In pure-exploration problems, information is gathered sequentially to an...
research
11/14/2021

Mean-based Best Arm Identification in Stochastic Bandits under Reward Contamination

This paper investigates the problem of best arm identification in contam...
research
10/15/2020

Probabilistic Sequential Shrinking: A Best Arm Identification Algorithm for Stochastic Bandits with Corruptions

We consider a best arm identification (BAI) problem for stochastic bandi...
research
07/22/2022

SPRT-based Efficient Best Arm Identification in Stochastic Bandits

This paper investigates the best arm identification (BAI) problem in sto...
research
07/27/2023

A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

We investigate the fixed-budget best-arm identification (BAI) problem fo...
research
02/22/2018

Options for encoding names for data linking at the Australian Bureau of Statistics

Publicly, ABS has said it would use a cryptographic hash function to con...
research
05/17/2023

Sequential Best-Arm Identification with Application to Brain-Computer Interface

A brain-computer interface (BCI) is a technology that enables direct com...

Please sign up or login with your details

Forgot password? Click here to reset