Distributed Planning in Hierarchical Factored MDPs

12/12/2012
by   Carlos E. Guestrin, et al.
0

We present a principled and efficient planning algorithm for collaborative multiagent dynamical systems. All computation, during both the planning and the execution phases, is distributed among the agents; each agent only needs to model and plan for a small part of the system. Each of these local subsystems is small, but once they are combined they can represent an exponentially larger problem. The subsystems are connected through a subsystem hierarchy. Coordination and communication between the agents is not imposed, but derived directly from the structure of this hierarchy. A globally consistent plan is achieved by a message passing algorithm, where messages correspond to natural local reward functions and are computed by local linear programs; another message passing algorithm allows us to execute the resulting policy. When two portions of the hierarchy share the same structure, our algorithm can reuse plans and messages to speed up computation.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 9

page 10

research
10/24/2022

A partial order view of message-passing communication models

There is a wide variety of message-passing communication models, ranging...
research
03/07/2022

MS2MP: A Min-Sum Message Passing Algorithm for Motion Planning

Gaussian Process (GP) formulation of continuoustime trajectory offers a ...
research
07/12/2012

Expectation Propagation in Gaussian Process Dynamical Systems: Extended Version

Rich and complex time-series data, such as those generated from engineer...
research
02/22/2023

Faabric: Fine-Grained Distribution of Scientific Workloads in the Cloud

With their high parallelism and resource needs, many scientific applicat...
research
01/30/2013

A Comparison of Lauritzen-Spiegelhalter, Hugin, and Shenoy-Shafer Architectures for Computing Marginals of Probability Distributions

In the last decade, several architectures have been proposed for exact c...
research
04/11/2023

Feudal Graph Reinforcement Learning

We focus on learning composable policies to control a variety of physica...
research
07/02/2020

MPLP: Learning a Message Passing Learning Protocol

We present a novel method for learning the weights of an artificial neur...

Please sign up or login with your details

Forgot password? Click here to reset