-
An optimal algorithm for the Thresholding Bandit Problem
We study a specific combinatorial pure exploration stochastic bandit pro...
read it
-
Multitask Bandit Learning through Heterogeneous Feedback Aggregation
In many real-world applications, multiple agents seek to learn how to pe...
read it
-
Polynomial-time Algorithms for Combinatorial Pure Exploration with Full-bandit Feedback
We study the problem of stochastic combinatorial pure exploration (CPE),...
read it
-
Maximin Action Identification: A New Bandit Framework for Games
We study an original problem of pure exploration in a strategic bandit m...
read it
-
Combinatorial Pure Exploration with Full-bandit Feedback and Beyond: Solving Combinatorial Optimization under Uncertainty with Limited Observation
Combinatorial optimization is one of the fundamental research fields tha...
read it
-
Non-Asymptotic Pure Exploration by Solving Games
Pure exploration (aka active testing) is the fundamental task of sequent...
read it
-
Volumetric Spanners: an Efficient Exploration Basis for Learning
Numerous machine learning problems require an exploration basis - a mech...
read it
Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit
Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. However, it is a bit surprising that pure exploration, a basic problem in bandit theory, has not been well studied in MNL-bandit so far. In this paper we give efficient algorithms for pure exploration in MNL-bandit. Our algorithms achieve instance-sensitive pull complexities. We also complement the upper bounds by an almost matching lower bound.
READ FULL TEXT
Comments
There are no comments yet.