Fictitious Play: Convergence, Smoothness, and Optimism

11/19/2019
by   Jacob Abernethy, et al.
0

We consider the dynamics of two-player zero-sum games, with the goal of understanding when such dynamics lead to equilibrium behavior at a fast rate. In particular, we study the dynamic known as fictitious play (FP) in which each player simultaneously best-responds to the empirical distribution of the historical plays of their opponent. Nearly 70 years ago it was shown by Robinson that FP does converge to the Nash Equilibrium, although the rate she proved was exponential in the total number of actions of the players. In 1959, Karlin conjectured that FP converges at the more natural rate of O(1/ϵ^2). However, Daskalakis and Pan disproved a version of this conjecture in 2014, showing that an exponentially-slow rate can occur, although their result relied on adversarial tie-breaking. In this paper, we show that Karlin's conjecture is indeed correct in two major instances if you appropriately handle ties. First, we show that if the game matrix is diagonal and ties are broken lexicographically, then FP converges at a O(1/ϵ^2) rate, and we also show a matching lower bound under this tie-breaking assumption. Our second result shows that FP converges at a rate of O(1/ϵ^2) when the players' decision sets are smooth, and Õ(1/ϵ) under an additional assumption. In this last case, we also show that a modification of FP, known as Optimistic FP, converges at a rate of O(1/ϵ).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2021

Intelligent Players in a Fictitious Play Framework

Fictitious play is a popular learning algorithm in which players that ut...
research
03/19/2023

Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games

We study the sample complexity of identifying an approximate equilibrium...
research
11/30/2019

Smooth Fictitious Play in N× 2 Potential Games

The paper shows that smooth fictitious play converges to a neighborhood ...
research
03/03/2023

Can We Find Nash Equilibria at a Linear Rate in Markov Games?

We study decentralized learning in two-player zero-sum discounted Markov...
research
03/26/2022

Robust No-Regret Learning in Min-Max Stackelberg Games

The behavior of no-regret learning algorithms is well understood in two-...
research
07/15/2021

EPTAS for stable allocations in matching games

Gale-Shapley introduced a matching problem between two sets of agents wh...
research
07/11/2019

Proportional Dynamics in Exchange Economies

We study the Proportional Response dynamic in exchange economies, where ...

Please sign up or login with your details

Forgot password? Click here to reset