Equivalence Analysis between Counterfactual Regret Minimization and Online Mirror Descent

10/11/2021
by   Weiming Liu, et al.
1

Counterfactual Regret Minimization (CFR) is a kind of regret minimization algorithm that minimizes the total regret by minimizing the local counterfactual regrets. CFRs have a fast convergence rate in practice and they have been widely used for solving large-scale imperfect-information Extensive-form games (EFGs). However, due to their locality, CFRs are difficult to analyze and extend. Follow-the-Regularized-Lead (FTRL) and Online Mirror Descent (OMD) algorithms are regret minimization algorithms in Online Convex Optimization. They are mathematically elegant but less practical in solving EFGs. In this paper, we provide a new way to analyze and extend CFRs, by proving that CFR with Regret Matching and CFR with Regret Matching+ are special forms of FTRL and OMD, respectively. With these equivalences, two new algorithms, which can be considered as the extensions of vanilla CFR and CFR+, are deduced from the perspective of FTRL and OMD. In these two variants, maintaining the local counterfactual regrets is not necessary anymore. The experiments show that the two variants converge faster than vanilla CFR and CFR+ in some EFGs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2020

Faster Game Solving via Predictive Blackwell Approachability: Connecting Regret Matching and Mirror Descent

Blackwell approachability is a framework for reasoning about repeated ga...
research
05/24/2023

Regret Matching+: (In)Stability and Fast Convergence in Games

Regret Matching+ (RM+) and its variants are important algorithms for sol...
research
09/10/2018

Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games

Regret minimization is a powerful tool for solving large-scale extensive...
research
09/11/2018

Solving Imperfect-Information Games via Discounted Regret Minimization

Counterfactual regret minimization (CFR) is a family of iterative algori...
research
12/20/2021

Fast Algorithms for Poker Require Modelling it as a Sequential Bayesian Game

Many recent results in imperfect information games were only formulated ...
research
05/27/2021

Conic Blackwell Algorithm: Parameter-Free Convex-Concave Saddle-Point Solving

We develop new parameter and scale-free algorithms for solving convex-co...
research
07/20/2020

Unlocking the Potential of Deep Counterfactual Value Networks

Deep counterfactual value networks combined with continual resolving pro...

Please sign up or login with your details

Forgot password? Click here to reset