Graph neural networks are useful for learning problems, as well as for
c...
We study K-armed bandit problems where the reward distributions of the a...
Simple regret is a natural and parameter-free performance criterion for
...
In sparse linear bandits, a learning agent sequentially selects an actio...
Linear bandits have a wide variety of applications including recommendat...
In this work we consider the problem of regret minimization for logistic...
Logistic Bandits have recently undergone careful scrutiny by virtue of t...
The PhD thesis of Maillard (2013) presents a randomized algorithm for th...
In online learning problems, exploiting low variance plays an important ...
Structured stochastic multi-armed bandits provide accelerated regret rat...
We propose improved fixed-design confidence bounds for the linear logist...
We study stochastic structured bandits for minimizing regret. The fact t...
In this paper, we consider the nonparametric least square regression in ...
We consider the problem of unconstrained online convex optimization (OCO...
We introduce the bilinear bandit problem with low-rank structure where a...
We study adversarial attacks that manipulate the reward signals to contr...
We study offline data poisoning attacks in contextual bandits, a class o...
A key challenge in online learning is that classical algorithms can be s...
Generalized Linear Bandits (GLBs), a natural extension of the stochastic...
This paper describes a new parameter-free online learning algorithm for
...
In graph-based active learning, algorithms based on expected error
minim...