What game are we playing? End-to-end learning in normal and extensive form games

05/07/2018
by   Chun Kai Ling, et al.
0

Although recent work in AI has made great progress in solving large, zero-sum, extensive-form games, the underlying assumption in most past work is that the parameters of the game itself are known to the agents. This paper deals with the relatively under-explored but equally important "inverse" setting, where the parameters of the underlying game are not known to all agents, but must be learned through observations. We propose a differentiable, end-to-end learning framework for addressing this task. In particular, we consider a regularized version of the game, equivalent to a particular form of quantal response equilibrium, and develop 1) a primal-dual Newton method for finding such equilibrium points in both normal and extensive form games; and 2) a backpropagation method that lets us analytically compute gradients of all relevant game parameters through the solution itself. This ultimately lets us learn the game by training in an end-to-end fashion, effectively by integrating a "differentiable game solver" into the loop of larger deep network architectures. We demonstrate the effectiveness of the learning method in several settings including poker and security game tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2019

Large Scale Learning of Agent Rationality in Two-Player Zero-Sum Games

With the recent advances in solving large, zero-sum extensive form games...
research
06/29/2020

Small Nash Equilibrium Certificates in Very Large Games

In many game settings, the game is not explicitly given but is only acce...
research
07/26/2023

Beyond Strict Competition: Approximate Convergence of Multi Agent Q-Learning Dynamics

The behaviour of multi-agent learning in competitive settings is often c...
research
04/27/2023

Conditional dominance in games with unawareness

Heifetz, Meier, and Schipper (2013) introduced dynamic game with unaware...
research
03/20/2023

Convergence analysis and acceleration of the smoothing methods for solving extensive-form games

The extensive-form game has been studied considerably in recent years. I...
research
08/24/2023

SC-PSRO: A Unified Strategy Learning Method for Normal-form Games

Solving Nash equilibrium is the key challenge in normal-form games with ...
research
03/01/2017

OptNet: Differentiable Optimization as a Layer in Neural Networks

This paper presents OptNet, a network architecture that integrates optim...

Please sign up or login with your details

Forgot password? Click here to reset