Anticipatory Fictitious Play

12/20/2022
by   Alex Cloud, et al.
0

Fictitious play is an algorithm for computing Nash equilibria of matrix games. Recently, machine learning variants of fictitious play have been successfully applied to complicated real-world games. This paper presents a simple modification of fictitious play which is a strict improvement over the original: it has the same theoretical worst-case convergence rate, is equally applicable in a machine learning context, and enjoys superior empirical performance. We conduct an extensive comparison of our algorithm with fictitious play, proving an optimal convergence rate for certain classes of games, demonstrating superior performance numerically across a variety of games, and concluding with experiments that extend these algorithms to the setting of deep multiagent reinforcement learning.

READ FULL TEXT
research
03/03/2016

Deep Reinforcement Learning from Self-Play in Imperfect-Information Games

Many real-world applications can be described as large-scale games of im...
research
07/25/2017

On the Exponential Rate of Convergence of Fictitious Play in Potential Games

The paper studies fictitious play (FP) learning dynamics in continuous t...
research
06/01/2021

Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence

We study the performance of the gradient play algorithm for multi-agent ...
research
12/20/2022

Adapting the Exploration Rate for Value-of-Information-Based Reinforcement Learning

In this paper, we consider the problem of adjusting the exploration rate...
research
06/08/2020

Hedging in games: Faster convergence of external and swap regrets

We consider the setting where players run the Hedge algorithm or its opt...
research
06/19/2022

The Power of Regularization in Solving Extensive-Form Games

In this paper, we investigate the power of regularization, a common tech...
research
05/19/2021

Modeling Precomputation In Games Played Under Computational Constraints

Understanding the properties of games played under computational constra...

Please sign up or login with your details

Forgot password? Click here to reset