Non-Stochastic Multi-Player Multi-Armed Bandits: Optimal Rate With Collision Information, Sublinear Without

04/28/2019
by   Sébastien Bubeck, et al.
4

We consider the non-stochastic version of the (cooperative) multi-player multi-armed bandit problem. The model assumes no communication at all between the players, and furthermore when two (or more) players select the same action this results in a maximal loss. We prove the first √(T)-type regret guarantee for this problem, under the feedback model where collisions are announced to the colliding players. Such a bound was not known even for the simpler stochastic version. We also prove the first sublinear guarantee for the feedback model where collision information is not available, namely T^1-1/2m where m is the number of players.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/21/2018

SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits

We consider the stochastic multiplayer multi-armed bandit problem, where...
research
02/10/2021

Player Modeling via Multi-Armed Bandits

This paper focuses on building personalized player models solely from pl...
research
11/08/2020

Cooperative and Stochastic Multi-Player Multi-Armed Bandit: Optimal Regret With Neither Communication Nor Collisions

We consider the cooperative multi-player version of the stochastic multi...
research
02/29/2020

Decentralized Multi-player Multi-armed Bandits with No Collision Information

The decentralized stochastic multi-player multi-armed bandit (MP-MAB) pr...
research
08/25/2018

Multiplayer bandits without observing collision information

We study multiplayer stochastic multi-armed bandit problems in which the...
research
11/02/2020

On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications

We study the notoriously difficult no-sensing adversarial multi-player m...
research
07/26/2023

Active Robot Vision for Distant Object Change Detection: A Lightweight Training Simulator Inspired by Multi-Armed Bandits

In ground-view object change detection, the recently emerging map-less n...

Please sign up or login with your details

Forgot password? Click here to reset