Finding and Certifying (Near-)Optimal Strategies in Black-Box Extensive-Form Games

09/15/2020
by   Brian Hu Zhang, et al.
0

Often – for example in war games, strategy video games, and financial simulations – the game is given to us only as a black-box simulator in which we can play it. In these settings, since the game may have unknown nature action distributions (from which we can only obtain samples) and/or be too large to expand fully, it can be difficult to compute strategies with guarantees on exploitability. Recent work <cit.> resulted in a notion of certificate for extensive-form games that allows exploitability guarantees while not expanding the full game tree. However, that work assumed that the black box could sample or expand arbitrary nodes of the game tree at any time, and that a series of exact game solves (via, for example, linear programming) can be conducted to compute the certificate. Each of those two assumptions severely restricts the practical applicability of that method. In this work, we relax both of the assumptions. We show that high-probability certificates can be obtained with a black box that can do nothing more than play through games, using only a regret minimizer as a subroutine. As a bonus, we obtain an equilibrium-finding algorithm with Õ(1/√(T)) convergence rate in the extensive-form game setting that does not rely on a sampling strategy with lower-bounded reach probabilities (which MCCFR assumes). We demonstrate experimentally that, in the black-box setting, our methods are able to provide nontrivial exploitability guarantees while expanding only a small fraction of the game tree.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2020

Small Nash Equilibrium Certificates in Very Large Games

In many game settings, the game is not explicitly given but is only acce...
research
02/01/2022

Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games

While extensive-form games (EFGs) can be converted into normal-form game...
research
03/08/2021

Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games

Regret minimization has proved to be a versatile tool for tree-form sequ...
research
11/18/2019

Learning Probably Approximately Correct Maximin Strategies in Simulation-Based Games with Infinite Strategy Spaces

We tackle the problem of learning equilibria in simulation-based games. ...
research
04/27/2018

Approximating Nash Equilibria for Black-Box Games: A Bayesian Optimization Approach

Game theory has emerged as a powerful framework for modeling a large ran...
research
03/05/2018

A Genetic Programming Framework for 2D Platform AI

There currently exists a wide range of techniques to model and evolve ar...
research
09/11/2021

Team Correlated Equilibria in Zero-Sum Extensive-Form Games via Tree Decompositions

Despite the many recent practical and theoretical breakthroughs in compu...

Please sign up or login with your details

Forgot password? Click here to reset