Online Planning for Decentralized Stochastic Control with Partial History Sharing

08/06/2019
by   Kaiqing Zhang, et al.
0

In decentralized stochastic control, standard approaches for sequential decision-making, e.g. dynamic programming, quickly become intractable due to the need to maintain a complex information state. Computational challenges are further compounded if agents do not possess complete model knowledge. In this paper, we take advantage of the fact that in many problems agents share some common information, or history, termed partial history sharing. Under this information structure the policy search space is greatly reduced. We propose a provably convergent, online tree-search based algorithm that does not require a closed-form model or explicit communication among agents. Interestingly, our algorithm can be viewed as a generalization of several existing heuristic solvers for decentralized partially observable Markov decision processes. To demonstrate the applicability of the model, we propose a novel collaborative intrusion response model, where multiple agents (defenders) possessing asymmetric information aim to collaboratively defend a computer network. Numerical results demonstrate the performance of our algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2012

Scaling Up Decentralized MDPs Through Heuristic Search

Decentralized partially observable Markov decision processes (Dec-POMDPs...
research
01/16/2013

The Complexity of Decentralized Control of Markov Decision Processes

Planning for distributed agents with partial state information is consid...
research
05/19/2023

Monte-Carlo Search for an Equilibrium in Dec-POMDPs

Decentralized partially observable Markov decision processes (Dec-POMDPs...
research
02/26/2019

Information Gathering in Decentralized POMDPs by Policy Graph Improvement

Decentralized policies for information gathering are required when multi...
research
07/04/2012

MAA*: A Heuristic Search Algorithm for Solving Decentralized POMDPs

We present multi-agent A* (MAA*), the first complete and optimal heurist...
research
06/30/2011

Decentralized Control of Cooperative Systems: Categorization and Complexity Analysis

Decentralized control of cooperative systems captures the operation of a...
research
08/01/2011

Exploiting Agent and Type Independence in Collaborative Graphical Bayesian Games

Efficient collaborative decision making is an important challenge for mu...

Please sign up or login with your details

Forgot password? Click here to reset