Reduced Space and Faster Convergence in Imperfect-Information Games via Regret-Based Pruning

09/12/2016
by   Noam Brown, et al.
0

Counterfactual Regret Minimization (CFR) is the most popular iterative algorithm for solving zero-sum imperfect-information games. Regret-Based Pruning (RBP) is an improvement that allows poorly-performing actions to be temporarily pruned, thus speeding up CFR. We introduce Total RBP, a new form of RBP that reduces the space requirements of CFR as actions are pruned. We prove that in zero-sum games it asymptotically prunes any action that is not part of a best response to some Nash equilibrium. This leads to provably faster convergence and lower space requirements. Experiments show that Total RBP results in an order of magnitude reduction in space, and the reduction factor increases with game size.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/06/2020

Solving imperfect-information games via exponential counterfactual regret minimization

Two agents' decision-making problems can be modeled as the game with two...
research
06/05/2020

Sparsified Linear Programming for Zero-Sum Equilibrium Finding

Computational equilibrium finding in large zero-sum extensive-form imper...
research
01/30/2020

Fictitious Play Outperforms Counterfactual Regret Minimization

We compare the performance of two popular iterative algorithms, fictitio...
research
05/18/2021

CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space

In many real-world scenarios, a team of agents coordinate with each othe...
research
04/22/2021

Optimize Neural Fictitious Self-Play in Regret Minimization Thinking

Optimization of deep learning algorithms to approach Nash Equilibrium re...
research
04/24/2019

Solving zero-sum extensive-form games with arbitrary payoff uncertainty models

Modeling strategic conflict from a game theoretical perspective involves...
research
07/27/2018

Acceleration through Optimistic No-Regret Dynamics

We consider the problem of minimizing a smooth convex function by reduci...

Please sign up or login with your details

Forgot password? Click here to reset