PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning

09/10/2018
by   Guillaume Sartoretti, et al.
0

Multi-agent path finding (MAPF) is an essential component of many large-scale, real-world robot deployments, from aerial swarms to warehouse automation to collaborative search-and-rescue. However, despite the community's continued efforts, most state-of-the-art MAPF algorithms still rely on centralized planning, and scale poorly past a few hundred agents. Such planning approaches are maladapted to real-world deployment, where noise and uncertainty often require paths to be recomputed or adapted online, which is impossible when planning times are in seconds to minutes. In this work, we present PRIMAL, a novel framework for MAPF that combines reinforcement and imitation learning to teach fully-decentralized policies, where agents reactively plan paths online in a partially-observable world while exhibiting implicit coordination. Our framework extends our previous works on distributed learning of collaborative policies by introducing demonstrations of an expert MAPF algorithm during training, as well as careful reward shaping and environment sampling. Once learned, the resulting policy can be copied onto any number of agents, and naturally scales to different team sizes and world dimensions. We present results on randomized worlds with up to 1024 agents and compare success rates against state-of-the-art MAPF algorithms. Finally, we show experimental results of the learned policies in a hybrid simulation of a factory mock-up, involving both real-world and simulated robots.

READ FULL TEXT
research
10/16/2020

PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning – Lifelong

Multi-agent path finding (MAPF) is an indispensable component of large-s...
research
01/04/2020

Multi-Agent Interactions Modeling with Correlated Policies

In multi-agent systems, complex interacting behaviors arise due to the h...
research
06/15/2020

ForMIC: Foraging via Multiagent RL with Implicit Communication

Multi-agent foraging (MAF) involves distributing a team of agents to sea...
research
11/04/2020

Moving Forward in Formation: A Decentralized Hierarchical Learning Approach to Multi-Agent Moving Together

Multi-agent path finding in formation has many potential real-world appl...
research
02/08/2022

Multi-Agent Path Finding with Prioritized Communication Learning

Multi-agent pathfinding (MAPF) has been widely used to solve large-scale...
research
09/19/2017

Learning of Coordination Policies for Robotic Swarms

Inspired by biological swarms, robotic swarms are envisioned to solve re...
research
05/25/2021

From Motor Control to Team Play in Simulated Humanoid Football

Intelligent behaviour in the physical world exhibits structure at multip...

Please sign up or login with your details

Forgot password? Click here to reset