Cooperative Online Learning: Keeping your Neighbors Updated

01/23/2019
by   Nicolò Cesa-Bianchi, et al.
18

We study an asynchronous online learning setting with a network of agents. At each time step, some of the agents are activated, requested to make a prediction, and pay the corresponding loss. The loss function is then revealed to these agents and also to their neighbors in the network. When activations are stochastic, we show that the regret achieved by N agents running the standard online Mirror Descent is O(√(α T)), where T is the horizon and α< N is the independence number of the network. This is in contrast to the regret Ω(√(N T)) which N agents incur in the same setting when feedback is not shared. We also show a matching lower bound of order √(α T) that holds for any given network. When the pattern of agent activations is arbitrary, the problem changes significantly: we prove a Ω(T) lower bound on the regret that holds for any online algorithm oblivious to the feedback source.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2021

Cooperative Online Learning

In this preliminary (and unpolished) version of the paper, we study an a...
research
01/29/2019

Decentralized Online Learning: Take Benefits from Others' Data without Sharing Your Own to Track Global Trend

Decentralized Online Learning (online learning in decentralized networks...
research
03/14/2016

Online Isotonic Regression

We consider the online version of the isotonic regression problem. Given...
research
08/30/2013

Online Ranking: Discrete Choice, Spearman Correlation and Other Feedback

Given a set V of n objects, an online ranking system outputs at each tim...
research
11/10/2019

Grinding the Space: Learning to Classify Against Strategic Agents

We study the problem of online learning in strategic classification sett...
research
10/01/2013

Online Learning of Dynamic Parameters in Social Networks

This paper addresses the problem of online learning in a dynamic setting...
research
10/05/2020

An Efficient Algorithm for Cooperative Semi-Bandits

We consider the problem of asynchronous online combinatorial optimizatio...

Please sign up or login with your details

Forgot password? Click here to reset