Near-Optimal No-Regret Learning in General Games

08/16/2021
by   Constantinos Daskalakis, et al.
0

We show that Optimistic Hedge – a common variant of multiplicative-weights-updates with recency bias – attains poly(log T) regret in multi-player general-sum games. In particular, when every player of the game uses Optimistic Hedge to iteratively update her strategy in response to the history of play so far, then after T rounds of interaction, each player experiences total regret that is poly(log T). Our bound improves, exponentially, the O(T^1/2) regret attainable by standard no-regret learners in games, the O(T^1/4) regret attainable by no-regret learners with recency bias (Syrgkanis et al., 2015), and the O(T^1/6) bound that was recently shown for Optimistic Hedge in the special case of two-player games (Chen Pen, 2020). A corollary of our bound is that Optimistic Hedge converges to coarse correlated equilibrium in general games at a rate of Õ(1/T).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2022

Near-Optimal Φ-Regret Learning in Extensive-Form Games

In this paper, we establish efficient and uncoupled learning dynamics so...
research
06/08/2020

Hedging in games: Faster convergence of external and swap regrets

We consider the setting where players run the Hedge algorithm or its opt...
research
03/22/2022

Optimistic Mirror Descent Either Converges to Nash or to Strong Coarse Correlated Equilibria in Bimatrix Games

We show that, for any sufficiently small fixed ϵ > 0, when both players ...
research
11/11/2021

Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-Player General-Sum Games

Recently, Daskalakis, Fishelson, and Golowich (DFG) (NeurIPS`21) showed ...
research
09/30/2019

Strategizing against No-regret Learners

How should a player who repeatedly plays a game against a no-regret lear...
research
01/13/2021

On Misspecification in Prediction Problems and Robustness via Improper Learning

We study probabilistic prediction games when the underlying model is mis...
research
11/29/2021

Optimal No-Regret Learning in General Games: Bounded Regret with Unbounded Step-Sizes via Clairvoyant MWU

In this paper we solve the problem of no-regret learning in general game...

Please sign up or login with your details

Forgot password? Click here to reset