Rethinking Formal Models of Partially Observable Multiagent Decision Making

06/26/2019
by   Vojtěch Kovařík, et al.
5

Multiagent decision-making problems in partially observable environments are usually modeled as either extensive-form games (EFGs) within the game theory community or partially observable stochastic games (POSGs) within the reinforcement learning community. While most practical problems can be modeled in both formalisms, the communities using these models are mostly distinct with little sharing of ideas or advances. The last decade has seen dramatic progress in algorithms for EFGs, mainly driven by the challenge problem of poker. We have seen computational techniques achieving super-human performance, some variants of poker are essentially solved, and there are now sound local search algorithms which were previously thought impossible. While the advances have garnered attention, the fundamental advances are not yet understood outside the EFG community. This can be largely explained by the starkly different formalisms between the game theory and reinforcement learning communities and, further, by the unsuitability of the original EFG formalism to make the ideas simple and clear. This paper aims to address these hindrances, by advocating a new unifying formalism, a variant of POSGs, which we call Factored-Observation Games (FOGs). We prove that any timeable perfect-recall EFG can be efficiently modeled as a FOG as well as relating FOGs to other existing formalisms. Additionally, a FOG explicitly identifies the public and private components of observations, which is fundamental to the recent EFG breakthroughs. We conclude by presenting the two building-blocks of these breakthroughs — counterfactual regret minimization and public state decomposition — in the new formalism, illustrating our goal of a simpler path for sharing recent advances between game theory and reinforcement learning community.

READ FULL TEXT

page 15

page 22

research
11/15/2021

The Partially Observable History Process

We introduce the partially observable history process (POHP) formalism f...
research
06/14/2019

Problems with the EFG formalism: a solution attempt using observations

We argue that the extensive-form game (EFG) model isn't powerful enough ...
research
04/17/2018

On Improving Deep Reinforcement Learning for POMDPs

Deep Reinforcement Learning (RL) recently emerged as one of the most com...
research
08/26/2019

OpenSpiel: A Framework for Reinforcement Learning in Games

OpenSpiel is a collection of environments and algorithms for research in...
research
08/07/2014

Learning to Cooperate via Policy Search

Cooperative games are those in which both agents share the same payoff s...
research
02/05/2020

Partially Observable Games for Secure Autonomy

Technology development efforts in autonomy and cyber-defense have been e...
research
05/18/2021

Learning and Information in Stochastic Networks and Queues

We review the role of information and learning in the stability and opti...

Please sign up or login with your details

Forgot password? Click here to reset