Safe Linear Stochastic Bandits

11/21/2019
by   Kia Khezeli, et al.
0

We introduce the safe linear stochastic bandit framework—a generalization of linear stochastic bandits—where, in each stage, the learner is required to select an arm with an expected reward that is no less than a predetermined (safe) threshold with high probability. We assume that the learner initially has knowledge of an arm that is known to be safe, but not necessarily optimal. Leveraging on this assumption, we introduce a learning algorithm that systematically combines known safe arms with exploratory arms to safely expand the set of safe arms over time, while facilitating safe greedy exploitation in subsequent stages. In addition to ensuring the satisfaction of the safety constraint at every stage of play, the proposed algorithm is shown to exhibit an expected regret that is no more than O(√(T)log (T)) after T stages of play.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2019

Linear Stochastic Bandits Under Safety Constraints

Bandit algorithms have various application in safety-critical systems, w...
research
03/03/2021

Combinatorial Bandits without Total Order for Arms

We consider the combinatorial bandits problem, where at each time step, ...
research
07/26/2019

Lexicographic Multiarmed Bandit

We consider a multiobjective multiarmed bandit problem with lexicographi...
research
01/23/2019

Online Learning with Diverse User Preferences

In this paper, we investigate the impact of diverse user preference on l...
research
02/09/2022

Finding Optimal Arms in Non-stochastic Combinatorial Bandits with Semi-bandit Feedback and Finite Budget

We consider the combinatorial bandits problem with semi-bandit feedback ...
research
09/15/2023

Price of Safety in Linear Best Arm Identification

We introduce the safe best-arm identification framework with linear feed...
research
12/13/2021

Safe Linear Leveling Bandits

Multi-armed bandits (MAB) are extensively studied in various settings wh...

Please sign up or login with your details

Forgot password? Click here to reset