Doubly Adversarial Federated Bandits

01/22/2023
by   Jialin Yi, et al.
0

We study a new non-stochastic federated multi-armed bandit problem with multiple agents collaborating via a communication network. The losses of the arms are assigned by an oblivious adversary that specifies the loss of each arm not only for each time step but also for each agent, which we call “doubly adversarial". In this setting, different agents may choose the same arm in the same time step but observe different feedback. The goal of each agent is to find a globally best arm in hindsight that has the lowest cumulative loss averaged over all agents, which necessities the communication among agents. We provide regret lower bounds for any federated bandit algorithm under different settings, when agents have access to full-information feedback, or the bandit feedback. For the bandit feedback setting, we propose a near-optimal federated bandit algorithm called FEDEXP3. Our algorithm gives a positive answer to an open question proposed in Cesa-Bianchi et al. (2016): FEDEXP3 can guarantee a sub-linear regret without exchanging sequences of selected arm identities or loss sequences among agents. We also provide numerical evaluations of our algorithm to validate our theoretical results and demonstrate its effectiveness on synthetic and real-world datasets

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2022

Near-Optimal Collaborative Learning in Bandits

This paper introduces a general multi-agent bandit model in which each a...
research
10/20/2020

Bayesian Algorithms for Decentralized Stochastic Bandits

We study a decentralized cooperative multi-agent multi-armed bandit prob...
research
06/09/2021

Cooperative Online Learning

In this preliminary (and unpolished) version of the paper, we study an a...
research
09/14/2018

Dueling Bandits with Qualitative Feedback

We formulate and study a novel multi-armed bandit problem called the qua...
research
07/27/2023

Adversarial Sleeping Bandit Problems with Multiple Plays: Algorithm and Ranking Application

This paper presents an efficient algorithm to solve the sleeping bandit ...
research
10/29/2021

A/B/n Testing with Control in the Presence of Subpopulations

Motivated by A/B/n testing applications, we consider a finite set of dis...
research
03/29/2023

Federated Stochastic Bandit Learning with Unobserved Context

We study the problem of federated stochastic multi-arm contextual bandit...

Please sign up or login with your details

Forgot password? Click here to reset