Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit

12/02/2020
by   Nikolai Karpov, et al.
0

Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. However, it is a bit surprising that pure exploration, a basic problem in bandit theory, has not been well studied in MNL-bandit so far. In this paper we give efficient algorithms for pure exploration in MNL-bandit. Our algorithms achieve instance-sensitive pull complexities. We also complement the upper bounds by an almost matching lower bound.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2021

Pure Exploration Bandit Problem with General Reward Functions Depending on Full Distributions

In this paper, we study the pure exploration bandit model on general dis...
research
10/29/2021

Collaborative Pure Exploration in Kernel Bandit

In this paper, we formulate a Collaborative Pure Exploration in Kernel B...
research
05/27/2016

An optimal algorithm for the Thresholding Bandit Problem

We study a specific combinatorial pure exploration stochastic bandit pro...
research
06/09/2021

Cooperative Online Learning

In this preliminary (and unpolished) version of the paper, we study an a...
research
06/18/2022

Thompson Sampling for (Combinatorial) Pure Exploration

Existing methods of combinatorial pure exploration mainly focus on the U...
research
06/06/2023

Pivotuner: automatic real-time pure intonation and microtonal modulation

Pivotuner is a VST3/AU MIDI effect plugin that automatically tunes note ...
research
11/13/2018

Community Exploration: From Offline Optimization to Online Learning

We introduce the community exploration problem that has many real-world ...

Please sign up or login with your details

Forgot password? Click here to reset