A sub-modular receding horizon solution for mobile multi-agent persistent monitoring

08/12/2019
by   Navid Rezazadeh, et al.
0

We study the problem of persistent monitoring of finite number of inter-connected geographical nodes for event detection via a group of heterogeneous mobile agents. We use Poisson process model to capture the probability of the events occurring at the geographical nodes. We then tie a utility function to the probability of detecting an event in each point of interest and use it in our policy design to incentivize the agents to visit the geographical nodes with higher probability of event occurrence. We show that the design of an optimal monitoring policy to maximize the utility of event detection over a mission horizon is an NP-hard problem. By showing that the reward function is a monotone increasing and submodular function, we then proceed to propose a suboptimal dispatch policy design with a known optimality gap. To reduce the time complexity of constructing the feasible search set and also to induce robustness to changes in event occurrence and other operational factors, we preform our suboptimal policy design in a receding horizon setting. Our next contribution is to add a new term to our optimization problem to compensate for the shortsightedness of the receding horizon approach. This added term provides a measure of importance for nodes beyond the receding horizon's sight, and is meant to give the policy design an intuition to steer the agents towards areas with higher importance on the global map. Finally, we discuss how our proposed algorithm can be implemented in a decentralized manner. We demonstrate our results through a simulation study.

READ FULL TEXT
research
11/30/2020

Multi-Agent Maximization of a Monotone Submodular Function via Maximum Consensus

Constrained submodular set function maximization problems often appear i...
research
07/29/2021

Distributed Strategy Selection: A Submodular Set Function Maximization Approach

Constrained submodular set function maximization problems often appear i...
research
01/06/2022

Admissible Policy Teaching through Reward Design

We study reward design strategies for incentivizing a reinforcement lear...
research
11/02/2020

Multi-Agent Reinforcement Learning for Persistent Monitoring

The Persistent Monitoring (PM) problem seeks to find a set of trajectori...
research
05/30/2022

Designing Rewards for Fast Learning

To convey desired behavior to a Reinforcement Learning (RL) agent, a des...
research
01/20/2022

Invasion Dynamics in the Biased Voter Process

The voter process is a classic stochastic process that models the invasi...
research
05/25/2021

Bayesian Nonparametric Reinforcement Learning in LTE and Wi-Fi Coexistence

With the formation of next generation wireless communication, a growing ...

Please sign up or login with your details

Forgot password? Click here to reset