Distributed Online Learning for Joint Regret with Communication Constraints

02/15/2021
by   Dirk van der Hoeven, et al.
0

In this paper we consider a distributed online learning setting for joint regret with communication constraints. This is a multi-agent setting in which in each round t an adversary activates an agent, which has to issue a prediction. A subset of all the agents may then communicate a b-bit message to their neighbors in a graph. All agents cooperate to control the joint regret, which is the sum of the losses of the agents minus the losses evaluated at the best fixed common comparator parameters u. We provide a comparator-adaptive algorithm for this setting, which means that the joint regret scales with the norm of the comparator u. To address communication constraints we provide deterministic and stochastic gradient compression schemes and show that with these compression schemes our algorithm has worst-case optimal regret for the case that all agents communicate in every round. Additionally, we exploit the comparator-adaptive property of our algorithm to learn the best partition from a set of candidate partitions, which allows different subsets of agents to learn a different comparator.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2022

Collaborative Multi-agent Stochastic Linear Bandits

We study a collaborative multi-agent stochastic linear bandit setting, w...
research
12/03/2020

Distributed Thompson Sampling

We study a cooperative multi-agent multi-armed bandits with M agents and...
research
11/28/2019

Adaptive Communication Bounds for Distributed Online Learning

We consider distributed online learning protocols that control the excha...
research
10/04/2019

Social Learning in Multi Agent Multi Armed Bandits

In this paper, we introduce a distributed version of the classical stoch...
research
12/21/2020

Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism

Online learning has been successfully applied to many problems in which ...
research
06/07/2023

Optimal Fair Multi-Agent Bandits

In this paper, we study the problem of fair multi-agent multi-arm bandit...
research
03/17/2019

DSPG: Decentralized Simultaneous Perturbations Gradient Descent Scheme

In this paper, we present an asynchronous approximate gradient method th...

Please sign up or login with your details

Forgot password? Click here to reset