Regularization for Strategy Exploration in Empirical Game-Theoretic Analysis

02/09/2023
by   Yongzhao Wang, et al.
0

In iterative approaches to empirical game-theoretic analysis (EGTA), the strategy space is expanded incrementally based on analysis of intermediate game models. A common approach to strategy exploration, represented by the double oracle algorithm, is to add strategies that best-respond to a current equilibrium. This approach may suffer from overfitting and other limitations, leading the developers of the policy-space response oracle (PSRO) framework for iterative EGTA to generalize the target of best response, employing what they term meta-strategy solvers (MSSs). Noting that many MSSs can be viewed as perturbed or approximated versions of Nash equilibrium, we adopt an explicit regularization perspective to the specification and analysis of MSSs. We propose a novel MSS called regularized replicator dynamics (RRD), which simply truncates the process based on a regret criterion. We show that RRD is more adaptive than existing MSSs and outperforms them in various games. We extend our study to three-player games, for which the payoff matrix is cubic in the number of strategies and so exhaustively evaluating profiles may not be feasible. We propose a profile search method that can identify solutions from incomplete models, and combine this with iterative model construction using a regularized MSS. Finally, and most importantly, we reveal that the regret of best response targets has a tremendous influence on the performance of strategy exploration through experiments, which provides an explanation for the effectiveness of regularization in PSRO.

READ FULL TEXT
research
12/02/2021

Empirical Game-Theoretic Analysis in Mean Field Games

We present a simulation-based approach for solution of mean field games ...
research
05/21/2021

Evaluating Strategy Exploration in Empirical Game-Theoretic Analysis

In empirical game-theoretic analysis (EGTA), game models are extended it...
research
01/19/2022

Anytime PSRO for Two-Player Zero-Sum Games

Policy space response oracles (PSRO) is a multi-agent reinforcement lear...
research
01/12/2021

Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information

In this paper, we examine the Nash equilibrium convergence properties of...
research
01/20/2023

Computing equilibria by minimizing exploitability with best-response ensembles

In this paper, we study the problem of computing an approximate Nash equ...
research
02/01/2023

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

Multiagent reinforcement learning (MARL) has benefited significantly fro...
research
02/02/2023

Exploiting Extensive-Form Structure in Empirical Game-Theoretic Analysis

Empirical game-theoretic analysis (EGTA) is a general framework for reas...

Please sign up or login with your details

Forgot password? Click here to reset