AIVAT: A New Variance Reduction Technique for Agent Evaluation in Imperfect Information Games

12/20/2016
by   Neil Burch, et al.
0

Evaluating agent performance when outcomes are stochastic and agents use randomized strategies can be challenging when there is limited data available. The variance of sampled outcomes may make the simple approach of Monte Carlo sampling inadequate. This is the case for agents playing heads-up no-limit Texas hold'em poker, where man-machine competitions have involved multiple days of consistent play and still not resulted in statistically significant conclusions even when the winner's margin is substantial. In this paper, we introduce AIVAT, a low variance, provably unbiased value assessment tool that uses an arbitrary heuristic estimate of state value, as well as the explicit strategy of a subset of the agents. Unlike existing techniques which reduce the variance from chance events, or only consider game ending actions, AIVAT reduces the variance both from choices by nature and by players with a known strategy. The resulting estimator in no-limit poker can reduce the number of hands needed to draw statistical conclusions by more than a factor of 10.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2019

Low-Variance and Zero-Variance Baselines for Extensive-Form Games

Extensive-form games (EFGs) are a common model of multi-agent interactio...
research
05/21/2018

Depth-Limited Solving for Imperfect-Information Games

A fundamental challenge in imperfect-information games is that states do...
research
04/24/2017

Evaluating and Modelling Hanabi-Playing Agents

Agent modelling involves considering how other agents will behave, in or...
research
09/09/2018

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines

Learning strategies for imperfect information games from samples of inte...
research
12/29/2020

Knowledge-Based Strategies for Multi-Agent Teams Playing Against Nature

We study teams of agents that play against Nature towards achieving a co...
research
03/29/2019

MCTS-based Automated Negotiation Agent (Extended Abstract)

This paper introduces a new Negotiating Agent for automated negotiation ...
research
10/05/2016

Sampled Fictitious Play is Hannan Consistent

Fictitious play is a simple and widely studied adaptive heuristic for pl...

Please sign up or login with your details

Forgot password? Click here to reset