Censored Semi-Bandits for Resource Allocation

04/12/2021
by   Arun Verma, et al.
0

We consider the problem of sequentially allocating resources in a censored semi-bandits setup, where the learner allocates resources at each step to the arms and observes loss. The loss depends on two hidden parameters, one specific to the arm but independent of the resource allocation, and the other depends on the allocated resource. More specifically, the loss equals zero for an arm if the resource allocated to it exceeds a constant (but unknown) arm dependent threshold. The goal is to learn a resource allocation that minimizes the expected loss. The problem is challenging because the loss distribution and threshold value of each arm are unknown. We study this setting by establishing its `equivalence' to Multiple-Play Multi-Armed Bandits (MP-MAB) and Combinatorial Semi-Bandits. Exploiting these equivalences, we derive optimal algorithms for our problem setting using known algorithms for MP-MAB and Combinatorial Semi-Bandits. The experiments on synthetically generated data validate the performance guarantees of the proposed algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2019

Censored Semi-Bandits: A Framework for Resource Allocation with Censored Feedback

In this paper, we study Censored Semi-Bandits, a novel variant of the se...
research
06/17/2020

Stochastic Network Utility Maximization with Unknown Utilities: Multi-Armed Bandits Approach

In this paper, we study a novel Stochastic Network Utility Maximization ...
research
05/10/2021

Combinatorial Multi-armed Bandits for Resource Allocation

We study the sequential resource allocation problem where a decision mak...
research
09/16/2020

Thompson Sampling for Unsupervised Sequential Selection

Thompson Sampling has generated significant interest due to its better e...
research
06/06/2018

Finding the Bandit in a Graph: Sequential Search-and-Stop

We consider the problem where an agent wants to find a hidden object tha...
research
03/28/2018

A Better Resource Allocation Algorithm with Semi-Bandit Feedback

We study a sequential resource allocation problem between a fixed number...
research
09/05/2019

An Arm-wise Randomization Approach to Combinatorial Linear Semi-bandits

Combinatorial linear semi-bandits (CLS) are widely applicable frameworks...

Please sign up or login with your details

Forgot password? Click here to reset