Controlling the parameters' norm often yields good generalisation when
t...
Due to its empirical success on few shot classification and reinforcemen...
Due mostly to its application to cognitive radio networks, multiplayer
b...
The training of neural networks by gradient descent methods is a corners...
Multi-task learning leverages structural similarities between multiple t...
Motivated by packet routing in computer networks, online queuing systems...
We study online learning for optimal allocation when the resource to be
...
Potential buyers of a product or service tend to first browse feedback f...
We investigate stochastic combinatorial multi-armed bandit with semi-ban...
Motivated by cognitive radios, stochastic multi-player multi-armed bandi...
Private data are valuable either by remaining private (for instance if t...
We consider the stochastic multiplayer multi-armed bandit problem, where...
Plato provides sound and tight deterministic error guarantees for approx...