Adaptivity to the difficulties of a problem is a key property in sequent...
In this paper we propose a general methodology to derive regret bounds f...
In the stochastic multi-armed bandit problem, a randomized probability
m...
This paper considers the partial monitoring problem with k-actions and
d...
This paper considers the multi-armed bandit (MAB) problem and provides a...
We consider the fixed-budget best arm identification problem where the g...
We study the survival bandit problem, a variant of the multi-armed bandi...
This study considers online learning with general directed feedback grap...
We consider nonstationary multi-armed bandit problems where the model
pa...
Ordinary supervised learning is useful when we have paired training data...
Combinatorial optimization is one of the fundamental research fields tha...
Dense subgraph discovery aims to find a dense component in edge-weighted...
We investigate finite stochastic partial monitoring, which is a general ...
The Gaussian process bandit is a problem in which we want to find a maxi...
Many scientific experiments have an interest in the estimation of the av...
Uncoupled regression is the problem to learn a model from unlabeled data...
A classic setting of the stochastic K-armed bandit problem is considered...
We study the problem of stochastic combinatorial pure exploration (CPE),...
We study a bad arm existing checking problem in which a player's task is...
We investigate the problem of multiclass classification with rejection, ...
We formulate and study a novel multi-armed bandit problem called the
qua...
Unsupervised domain adaptation is the problem setting where data generat...
In this paper, we consider and discuss a new stochastic multi-armed band...
We propose the first fully-adaptive algorithm for pure exploration in li...
We study the K-armed dueling bandit problem, a variation of the standard...
Partial monitoring is a general model for sequential learning with limit...
We study the K-armed dueling bandit problem, a variation of the standard...
We discuss a multiple-play multi-armed bandit (MAB) problem in which sev...
Consider the problem of sampling sequentially from a finite number of N ...