A Variational Perturbative Approach to Planning in Graph-based Markov Decision Processes

12/04/2019
by   Dominik Linzner, et al.
0

Coordinating multiple interacting agents to achieve a common goal is a difficult task with huge applicability. This problem remains hard to solve, even when limiting interactions to be mediated via a static interaction-graph. We present a novel approximate solution method for multi-agent Markov decision problems on graphs, based on variational perturbation theory. We adopt the strategy of planning via inference, which has been explored in various prior works. We employ a non-trivial extension of a novel high-order variational method that allows for approximate inference in large networks and has been shown to surpass the accuracy of existing variational methods. To compare our method to two state-of-the-art methods for multi-agent planning on graphs, we apply the method different standard GMDP problems. We show that in cases, where the goal is encoded as a non-local cost function, our method performs well, while state-of-the-art methods approach the performance of random guess. In a final experiment, we demonstrate that our method brings significant improvement for synchronization tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2013

Fast Value Iteration for Goal-Directed Markov Decision Processes

Planning problems where effects of actions are non-deterministic can be ...
research
06/04/2016

Effective Multi-Robot Spatial Task Allocation using Model Approximations

Real-world multi-agent planning problems cannot be solved using decision...
research
04/10/2023

A Novel Point-based Algorithm for Multi-agent Control Using the Common Information Approach

The Common Information (CI) approach provides a systematic way to transf...
research
07/31/2023

Distributed Dynamic Programming forNetworked Multi-Agent Markov Decision Processes

The main goal of this paper is to investigate distributed dynamic progra...
research
05/04/2021

Polynomial-Time Algorithms for Multi-Agent Minimal-Capacity Planning

We study the problem of minimizing the resource capacity of autonomous a...
research
01/23/2019

Robust temporal difference learning for critical domains

We present a new Q-function operator for temporal difference (TD) learni...
research
08/14/2017

Motion Planning under Partial Observability using Game-Based Abstraction

We study motion planning problems where agents move inside environments ...

Please sign up or login with your details

Forgot password? Click here to reset