Intent-based Deep Reinforcement Learning for Multi-agent Informative Path Planning

03/09/2023
by   Tianze Yang, et al.
0

In multi-agent informative path planning (MAIPP), agents must collectively construct a global belief map of an underlying distribution of interest (e.g., gas concentration, light intensity, or pollution levels) over a given domain, based on measurements taken along their trajectory. They must frequently replan their path to balance the distributed exploration of new areas and the collective, meticulous exploitation of known high-interest areas, to maximize the information gained within a predefined budget (e.g., path length or working time). A common approach to achieving such cooperation relies on planning the agents' paths reactively, conditioned on other agents' future actions. However, as the agent's belief is updated continuously, these predicted future actions may not end up being the ones executed by agents, introducing a form of noise/inaccuracy in the system and often decreasing performance. In this work, we propose a decentralized deep reinforcement learning (DRL) approach to MAIPP, which relies on an attention-based neural network, where agents optimize long-term individual and cooperative objectives by explicitly sharing their intent (i.e., medium-/long-term future positions distribution, obtained from their individual policy) in a reactive, asynchronous manner. That is, in our work, intent sharing allows agents to learn to claim/avoid broader areas of the world. Moreover, since our approach relies on learned attention over these shared intents, agents are able to learn to recognize the useful portion(s) of these (imperfect) predictions to maximize cooperation even in the presence of imperfect information. Our comparison experiments demonstrate the performance of our approach compared to its variants and high-quality baselines over a large set of MAIPP simulations.

READ FULL TEXT

page 1

page 5

research
01/27/2023

ARiADNE: A Reinforcement learning approach using Attention-based Deep Networks for Exploration

In autonomous robot exploration tasks, a mobile robot needs to actively ...
research
09/09/2021

DAN: Decentralized Attention-based Neural Network to Solve the MinMax Multiple Traveling Salesman Problem

The multiple traveling salesman problem (mTSP) is a well-known NP-hard p...
research
10/04/2021

Multi-Agent Path Planning Using Deep Reinforcement Learning

In this paper a deep reinforcement based multi-agent path planning appro...
research
07/30/2020

MAPPER: Multi-Agent Path Planning with Evolutionary Reinforcement Learning in Mixed Dynamic Environments

Multi-agent navigation in dynamic environments is of great industrial va...
research
03/21/2022

Long Short-Term Memory for Spatial Encoding in Multi-Agent Path Planning

Reinforcement learning-based path planning for multi-agent systems of va...
research
10/19/2017

Consequentialist conditional cooperation in social dilemmas with imperfect information

Social dilemmas, where mutual cooperation can lead to high payoffs but p...
research
08/03/2021

Predictive Runtime Monitoring for Mobile Robots using Logic-Based Bayesian Intent Inference

We propose a predictive runtime monitoring framework that forecasts the ...

Please sign up or login with your details

Forgot password? Click here to reset