The Influence of Shape Constraints on the Thresholding Bandit Problem

06/17/2020
by   James Cheshire, et al.
0

We investigate the stochastic Thresholding Bandit problem (TBP) under several shape constraints. On top of (i) the vanilla, unstructured TBP, we consider the case where (ii) the sequence of arm's means (μ_k)_k is monotonically increasing MTBP, (iii) the case where (μ_k)_k is unimodal UTBP and (iv) the case where (μ_k)_k is concave CTBP. In the TBP problem the aim is to output, at the end of the sequential game, the set of arms whose means are above a given threshold. The regret is the highest gap between a misclassified arm and the threshold. In the fixed budget setting, we provide problem independent minimax rates for the expected regret in all settings, as well as associated algorithms. We prove that the minimax rates for the regret are (i) √(log(K)K/T) for TBP, (ii) √(log(K)/T) for MTBP, (iii) √(K/T) for UTBP and (iv) √(loglog K/T) for CTBP, where K is the number of arms and T is the budget. These rates demonstrate that the dependence on K of the minimax regret varies significantly depending on the shape constraint. This highlights the fact that the shape constraints modify fundamentally the nature of the TBP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2021

Problem Dependent View on Structured Thresholding Bandit Problems

We investigate the problem dependent regime in the stochastic Thresholdi...
research
05/27/2019

Thresholding Bandit with Optimal Aggregate Regret

We consider the thresholding bandit problem, whose goal is to find arms ...
research
12/25/2017

Stochastic Multi-armed Bandits in Constant Space

We consider the stochastic bandit problem in the sublinear space setting...
research
02/06/2023

Asymptotically Minimax Optimal Fixed-Budget Best Arm Identification for Expected Simple Regret Minimization

We investigate fixed-budget best arm identification (BAI) for expected s...
research
06/01/2021

Minimax Regret for Bandit Convex Optimisation of Ridge Functions

We analyse adversarial bandit convex optimisation with an adversary that...
research
04/26/2022

Bandwidth Allocation Games

Internet providers often offer data plans that, for each user's monthly ...
research
05/17/2019

Pair Matching: When bandits meet stochastic block model

The pair-matching problem appears in many applications where one wants t...

Please sign up or login with your details

Forgot password? Click here to reset