Multi-Agent Online Optimization with Delays: Asynchronicity, Adaptivity, and Optimism

12/21/2020
by   Yu-Guan Hsieh, et al.
0

Online learning has been successfully applied to many problems in which data are revealed over time. In this paper, we provide a general framework for studying multi-agent online learning problems in the presence of delays and asynchronicities. Specifically, we propose and analyze a class of adaptive dual averaging schemes in which agents only need to accumulate gradient feedback received from the whole system, without requiring any between-agent coordination. In the single-agent case, the adaptivity of the proposed method allows us to extend a range of existing results to problems with potentially unbounded delays between playing an action and receiving the corresponding feedback. In the multi-agent case, the situation is significantly more complicated because agents may not have access to a global clock to use as a reference point; to overcome this, we focus on the information that is available for producing each prediction rather than the actual delay associated with each feedback. This allows us to derive adaptive learning strategies with optimal regret bounds, at both the agent and network levels. Finally, we also analyze an "optimistic" variant of the proposed algorithm which is capable of exploiting the predictability of problems with a slower variation and leads to improved regret bounds.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2013

Online Learning under Delayed Feedback

Online learning with delayed feedback has received increasing attention ...
research
08/14/2020

Kernel Methods for Cooperative Multi-Agent Contextual Bandits

Cooperative multi-agent decision making involves a group of agents coope...
research
06/09/2021

Cooperative Online Learning

In this preliminary (and unpolished) version of the paper, we study an a...
research
02/15/2021

Distributed Online Learning for Joint Regret with Communication Constraints

In this paper we consider a distributed online learning setting for jo...
research
12/02/2022

Multi-Agent Reinforcement Learning with Reward Delays

This paper considers multi-agent reinforcement learning (MARL) where the...
research
05/04/2021

The distributed dual ascent algorithm is robust to asynchrony

The distributed dual ascent is an established algorithm to solve strongl...
research
05/24/2022

Multi-Head Online Learning for Delayed Feedback Modeling

In online advertising, it is highly important to predict the probability...

Please sign up or login with your details

Forgot password? Click here to reset