No-regret learning and mixed Nash equilibria: They do not mix

10/19/2020
by   Lampros Flokas, et al.
0

Understanding the behavior of no-regret dynamics in general N-player games is a fundamental question in online learning and game theory. A folk result in the field states that, in finite games, the empirical frequency of play under no-regret learning converges to the game's set of coarse correlated equilibria. By contrast, our understanding of how the day-to-day behavior of the dynamics correlates to the game's Nash equilibria is much more limited, and only partial results are known for certain classes of games (such as zero-sum or congestion games). In this paper, we study the dynamics of "follow-the-regularized-leader" (FTRL), arguably the most well-studied class of no-regret dynamics, and we establish a sweeping negative result showing that the notion of mixed Nash equilibrium is antithetical to no-regret learning. Specifically, we show that any Nash equilibrium which is not strict (in that every player has a unique best response) cannot be stable and attracting under the dynamics of FTRL. This result has significant implications for predicting the outcome of a learning process as it shows unequivocally that only strict (and hence, pure) Nash equilibria can emerge as stable limit points thereof.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/12/2021

Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information

In this paper, we examine the Nash equilibrium convergence properties of...
research
03/26/2022

Nash, Conley, and Computation: Impossibility and Incompleteness in Game Dynamics

Under what conditions do the behaviors of players, who play a game repea...
research
02/05/2023

Learning in quantum games

In this paper, we introduce a class of learning dynamics for general qua...
research
04/23/2015

Strategic Teaching and Learning in Games

It is known that there are uncoupled learning heuristics leading to Nash...
research
06/19/2023

Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games

The congestion game is a powerful model that encompasses a range of engi...
research
12/03/2020

On the Impossibility of Convergence of Mixed Strategies with No Regret Learning

We study convergence properties of the mixed strategies that result from...
research
02/16/2021

Follow-the-Regularized-Leader Routes to Chaos in Routing Games

We study the emergence of chaotic behavior of Follow-the-Regularized Lea...

Please sign up or login with your details

Forgot password? Click here to reset