Estimating α-Rank by Maximizing Information Gain

01/22/2021
by   Tabish Rashid, et al.
7

Game theory has been increasingly applied in settings where the game is not known outright, but has to be estimated by sampling. For example, meta-games that arise in multi-agent evaluation can only be accessed by running a succession of expensive experiments that may involve simultaneous deployment of several agents. In this paper, we focus on α-rank, a popular game-theoretic solution concept designed to perform well in such scenarios. We aim to estimate the α-rank of the game using as few samples as possible. Our algorithm maximizes information gain between an epistemic belief over the α-ranks and the observed payoff. This approach has two main benefits. First, it allows us to focus our sampling on the entries that matter the most for identifying the α-rank. Second, the Bayesian formulation provides a facility to build in modeling assumptions by using a prior over game payoffs. We show the benefits of using information gain as compared to the confidence interval criterion of ResponseGraphUCB (Rowland et al. 2019), and provide theoretical results justifying our method.

READ FULL TEXT

page 7

page 11

page 12

page 14

page 15

page 16

page 17

research
03/16/2018

A Generalised Method for Empirical Game Theoretic Analysis

This paper provides theoretical bounds for empirical game theoretical an...
research
03/31/2019

Rank Reduction in Bimatrix Games

The rank of a bimatrix game is defined as the rank of the sum of the pay...
research
04/04/2022

The Parking Problem: A Game-Theoretic Solution

In this paper, we propose a game-theoretic solution to the parking probl...
research
02/07/2018

From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning

Multi-agent Systems (MASs) have found a variety of industrial applicatio...
research
10/22/2020

A Multilinear Sampling Algorithm to Estimate Shapley Values

Shapley values are great analytical tools in game theory to measure the ...
research
09/21/2019

Multiagent Evaluation under Incomplete Information

This paper investigates the evaluation of learned multiagent strategies ...
research
06/15/2021

Plane and Sample: Maximizing Information about Autonomous Vehicle Performance using Submodular Optimization

As autonomous vehicles (AVs) take on growing Operational Design Domains ...

Please sign up or login with your details

Forgot password? Click here to reset