Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation

04/17/2018
by   Thomy Phan, et al.
0

Making decisions is a great challenge in distributed autonomous environments due to enormous state spaces and uncertainty. Many online planning algorithms rely on statistical sampling to avoid searching the whole state space, while still being able to make acceptable decisions. However, planning often has to be performed under strict computational constraints making online planning in multi-agent systems highly limited, which could lead to poor system performance, especially in stochastic domains. In this paper, we propose Emergent Value function Approximation for Distributed Environments (EVADE), an approach to integrate global experience into multi-agent online planning in stochastic domains to consider global effects during local planning. For this purpose, a value function is approximated online based on the emergent system behaviour by using methods of reinforcement learning. We empirically evaluated EVADE with two statistical multi-agent online planning algorithms in a highly complex and stochastic smart factory environment, where multiple agents need to process various items at a shared set of machines. Our experiments show that EVADE can effectively improve the performance of multi-agent online planning while offering efficiency w.r.t. the breadth and depth of the planning process.

READ FULL TEXT
research
01/25/2019

Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies

Decision making in multi-agent systems (MAS) is a great challenge due to...
research
01/04/2020

Represented Value Function Approach for Large Scale Multi Agent Reinforcement Learning

In this paper, we consider the problem of large scale multi agent reinfo...
research
02/24/2017

Scalable Multiagent Coordination with Distributed Online Open Loop Planning

We propose distributed online open loop planning (DOOLP), a general fram...
research
11/22/2019

Multi-Objective Multi-Agent Planning for Jointly Discovering and Tracking Mobile Object

We consider the challenging problem of online planning for a team of age...
research
03/09/2022

Multi-Objective Multi-Agent Planning for Discovering and Tracking Unknown and Varying Number of Mobile Objects

We consider the online planning problem for a team of agents to discover...
research
03/22/2021

Reward-Reinforced Reinforcement Learning for Multi-agent Systems

Reinforcement learning algorithms in multi-agent systems deliver highly ...
research
12/19/2019

Uncertainty-sensitive Learning and Planning with Ensembles

We propose a reinforcement learning framework for discrete environments ...

Please sign up or login with your details

Forgot password? Click here to reset