Solving infinite-horizon Dec-POMDPs using Finite State Controllers within JESP

09/17/2021
by   Yang You, et al.
0

This paper looks at solving collaborative planning problems formalized as Decentralized POMDPs (Dec-POMDPs) by searching for Nash equilibria, i.e., situations where each agent's policy is a best response to the other agents' (fixed) policies. While the Joint Equilibrium-based Search for Policies (JESP) algorithm does this in the finite-horizon setting relying on policy trees, we propose here to adapt it to infinite-horizon Dec-POMDPs by using finite state controller (FSC) policy representations. In this article, we (1) explain how to turn a Dec-POMDP with N-1 fixed FSCs into an infinite-horizon POMDP whose solution is an N^th agent best response; (2) propose a JESP variant, called , using this to solve infinite-horizon Dec-POMDPs; (3) introduce heuristic initializations for JESP aiming at leading to good solutions; and (4) conduct experiments on state-of-the-art benchmark problems to evaluate our approach.

READ FULL TEXT
research
02/08/2021

Last-iterate Convergence of Decentralized Optimistic Gradient Descent/Ascent in Infinite-horizon Competitive Markov Games

We study infinite-horizon discounted two-player zero-sum Markov games, a...
research
05/19/2023

Monte-Carlo Search for an Equilibrium in Dec-POMDPs

Decentralized partially observable Markov decision processes (Dec-POMDPs...
research
06/13/2012

Sparse Stochastic Finite-State Controllers for POMDPs

Bounded policy iteration is an approach to solving infinite-horizon POMD...
research
05/01/2015

Stick-Breaking Policy Learning in Dec-POMDPs

Expectation maximization (EM) has recently been shown to be an efficient...
research
03/15/2012

Anytime Planning for Decentralized POMDPs using Expectation Maximization

Decentralized POMDPs provide an expressive framework for multi-agent seq...
research
08/16/2023

Eliciting Risk Aversion with Inverse Reinforcement Learning via Interactive Questioning

This paper proposes a novel framework for identifying an agent's risk av...
research
11/04/2021

Infinite Time Horizon Safety of Bayesian Neural Networks

Bayesian neural networks (BNNs) place distributions over the weights of ...

Please sign up or login with your details

Forgot password? Click here to reset