On Bellman's Optimality Principle for zs-POSGs

06/29/2020
by   Olivier Buffet, et al.
0

Many non-trivial sequential decision-making problems are efficiently solved by relying on Bellman's optimality principle, i.e., exploiting the fact that sub-problems are nested recursively within the original problem. Here we show how it can apply to (infinite horizon) 2-player zero-sum partially observable stochastic games (zs-POSGs) by (i) taking a central planner's viewpoint, which can only reason on a sufficient statistic called occupancy state, and (ii) turning such problems into zero-sum occupancy Markov games (zs-OMGs). Then, exploiting the Lipschitz-continuity of the value function in occupancy space, one can derive a version of the HSVI algorithm (Heuristic Search Value Iteration) that provably finds an ϵ-Nash equilibrium in finite time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2022

HSVI can solve zero-sum Partially Observable Stochastic Games

State-of-the-art methods for solving 2-player zero-sum imperfect informa...
research
10/25/2021

HSVI fo zs-POSGs using Concavity, Convexity and Lipschitz Properties

Dynamic programming and heuristic search are at the core of state-of-the...
research
06/22/2016

Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information

Zero-sum stochastic games provide a rich model for competitive decision ...
research
10/21/2020

Solving Zero-Sum One-Sided Partially Observable Stochastic Games

Many security and other real-world situations are dynamic in nature and ...
research
07/13/2023

Multi-Player Zero-Sum Markov Games with Networked Separable Interactions

We study a new class of Markov games (MGs), Multi-player Zero-sum Markov...
research
02/25/2020

On Reinforcement Learning for Turn-based Zero-sum Markov Games

We consider the problem of finding Nash equilibrium for two-player turn-...
research
07/16/2020

Polyhedral value iteration for discounted games and energy games

We present a deterministic algorithm, solving discounted games with n no...

Please sign up or login with your details

Forgot password? Click here to reset