Preselection Bandits under the Plackett-Luce Model

07/13/2019
by   Viktor Bengs, et al.
0

In this paper, we introduce the Preselection Bandit problem, in which the learner preselects a subset of arms (choice alternatives) for a user, which then chooses the final arm from this subset. The learner is not aware of the user's preferences, but can learn them from observed choices. In our concrete setting, we allow these choices to be stochastic and model the user's actions by means of the Plackett-Luce model. The learner's main task is to preselect subsets that eventually lead to highly preferred choices. To formalize this goal, we introduce a reasonable notion of regret and derive lower bounds on the expected regret. Moreover, we propose algorithms for which the upper bound on expected regret matches the lower bound up to a logarithmic term of the time horizon.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2023

Tight Memory-Regret Lower Bounds for Streaming Bandits

In this paper, we investigate the streaming bandits problem, wherein the...
research
09/30/2015

Regret Lower Bound and Optimal Algorithm in Finite Stochastic Partial Monitoring

Partial monitoring is a general model for sequential learning with limit...
research
03/01/2019

Regret Minimisation in Multinomial Logit Bandits

We consider two regret minimisation problems over subsets of a finite gr...
research
06/15/2023

Logarithmic Bayes Regret Bounds

We derive the first finite-time logarithmic regret bounds for Bayesian b...
research
07/26/2019

Lexicographic Multiarmed Bandit

We consider a multiobjective multiarmed bandit problem with lexicographi...
research
11/26/2015

Gains and Losses are Fundamentally Different in Regret Minimization: The Sparse Case

We demonstrate that, in the classical non-stochastic regret minimization...
research
12/06/2016

Coactive Critiquing: Elicitation of Preferences and Features

When faced with complex choices, users refine their own preference crite...

Please sign up or login with your details

Forgot password? Click here to reset