Internet of Things (IoT) technologies have enabled numerous data-driven
...
Online learning to rank (OLTR) is a sequential decision-making problem w...
We introduce and study the online pause and resume problem. In this prob...
We study contextual combinatorial bandits with probabilistically trigger...
The recent advances of conversational recommendations provide a promisin...
In this paper, we study the combinatorial semi-bandits (CMAB) and focus ...
Multi-layered network exploration (MuLaNE) problem is an important probl...
We study the sequential resource allocation problem where a decision mak...
Online influence maximization has attracted much attention as a way to
m...
We consider the stochastic multi-armed bandit (MAB) problem in a setting...