Monte Carlo Continual Resolving for Online Strategy Computation in Imperfect Information Games

12/18/2018
by   Michal Šustr, et al.
0

Online game playing algorithms produce high-quality strategies with a fraction of memory and computation required by their offline alternatives. Continual Resolving (CR) is a recent theoretically sound approach to online game playing that has been used to outperform human professionals in poker. However, parts of the algorithm were specific to poker, which enjoys many properties not shared by other imperfect information games. We present a domain-independent formulation of CR applicable to any two-player zero-sum extensive-form games that works with an abstract resolving algorithm. We further describe and implement its Monte Carlo variant (MCCR) which uses Monte Carlo Counterfactual Regret Minimization (MCCFR) as a resolver. We prove the correctness of CR and show an O(T^-1/2)-dependence of MCCR's exploitability on the computation time. Furthermore, we present an empirical comparison of MCCR with incremental tree building to Online Outcome Sampling and Information-set MCTS on several domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2020

Competing in a Complex Hidden Role Game with Information Set Monte Carlo Tree Search

Advances in intelligent game playing agents have led to successes in per...
research
12/27/2018

Double Neural Counterfactual Regret Minimization

Counterfactual Regret Minimization (CRF) is a fundamental and effective ...
research
09/27/2017

Combining Prediction of Human Decisions with ISMCTS in Imperfect Information Games

Monte Carlo Tree Search (MCTS) has been extended to many imperfect infor...
research
07/22/2019

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Extensive-form games (EFGs) are a common model of multi-agent interactio...
research
01/13/2020

Donald Duck Holiday Game: A numerical analysis of a Game of the Goose role-playing variant

The 1996 Donald Duck Holiday Game is a role-playing variant of the histo...
research
01/18/2014

Learning to Win by Reading Manuals in a Monte-Carlo Framework

Domain knowledge is crucial for effective performance in autonomous cont...
research
09/04/2023

Pure Monte Carlo Counterfactual Regret Minimization

Counterfactual Regret Minimization (CFR) and its variants are the best a...

Please sign up or login with your details

Forgot password? Click here to reset