We investigate online classification with paid stochastic experts. Here,...
The classical algorithms used in tabular reinforcement learning (Value
I...
Black box optimisation of an unknown function from expensive and noisy
e...
The stochastic generalised linear bandit is a well-understood model for
...
The fidelity bandits problem is a variant of the K-armed bandit problem ...
There are many provably efficient algorithms for episodic reinforcement
...
Reinforcement learning algorithms are widely used in domains where it is...
The principle of optimism in the face of uncertainty underpins many
theo...
We study the recovering bandits problem, a variant of the stochastic
mul...
We study the bandits with delayed anonymous feedback problem, a variant ...