On the Impossibility of Convergence of Mixed Strategies with No Regret Learning

12/03/2020
by   Vidya Muthukumar, et al.
0

We study convergence properties of the mixed strategies that result from a general class of optimal no regret learning strategies in a repeated game setting where the stage game is any 2 by 2 competitive game (i.e. game for which all the Nash equilibria (NE) of the game are completely mixed). We consider the class of strategies whose information set at each step is the empirical average of the opponent's realized play (and the step number), that we call mean based strategies. We first show that there does not exist any optimal no regret, mean based strategy for player 1 that would result in the convergence of her mixed strategies (in probability) against an opponent that plays his Nash equilibrium mixed strategy at each step. Next, we show that this last iterate divergence necessarily occurs if player 2 uses any adaptive strategy with a minimal randomness property. This property is satisfied, for example, by any fixed sequence of mixed strategies for player 2 that converges to NE. We conjecture that this property holds when both players use optimal no regret learning strategies against each other, leading to the divergence of the mixed strategies with a positive probability. Finally, we show that variants of mean based strategies using recency bias, which have yielded last iterate convergence in deterministic min max optimization, continue to lead to this last iterate divergence. This demonstrates a crucial difference in outcomes between using the opponent's mixtures and realizations to make strategy updates.

READ FULL TEXT
research
09/14/2020

Convergence Analysis of No-Regret Bidding Algorithms in Repeated Auctions

The connection between games and no-regret algorithms has been widely st...
research
10/19/2020

No-regret learning and mixed Nash equilibria: They do not mix

Understanding the behavior of no-regret dynamics in general N-player gam...
research
08/18/2019

Geometrical Regret Matching of Mixed Strategies

We argue that the existing regret matchings for equilibrium approximatio...
research
04/22/2014

Finding safe strategies for competitive diffusion on trees

We study the two-player safe game of Competitive Diffusion, a game-theor...
research
10/08/2021

Nash Convergence of Mean-Based Learning Algorithms in First Price Auctions

Understanding the convergence properties of learning dynamics in repeate...
research
03/29/2023

Emergence of Locally Suboptimal Behavior in Finitely Repeated Games

We study the emergence of locally suboptimal behavior in finitely repeat...
research
12/01/2020

Wireless Secret Sharing Game between Two Legitimate Users and an Eavesdropper

Wireless secret sharing is crucial to information security in the era of...

Please sign up or login with your details

Forgot password? Click here to reset