Multi-Agent Reinforcement Learning for Dynamic Ocean Monitoring by a Swarm of Buoys

12/21/2020
by   Maryam Kouzehgar, et al.
0

Autonomous marine environmental monitoring problem traditionally encompasses an area coverage problem which can only be effectively carried out by a multi-robot system. In this paper, we focus on robotic swarms that are typically operated and controlled by means of simple swarming behaviors obtained from a subtle, yet ad hoc combination of bio-inspired strategies. We propose a novel and structured approach for area coverage using multi-agent reinforcement learning (MARL) which effectively deals with the non-stationarity of environmental features. Specifically, we propose two dynamic area coverage approaches: (1) swarm-based MARL, and (2) coverage-range-based MARL. The former is trained using the multi-agent deep deterministic policy gradient (MADDPG) approach whereas, a modified version of MADDPG is introduced for the latter with a reward function that intrinsically leads to a collective behavior. Both methods are tested and validated with different geometric shaped regions with equal surface area (square vs. rectangle) yielding acceptable area coverage, and benefiting from the structured learning in non-stationary environments. Both approaches are advantageous compared to a naïve swarming method. However, coverage-range-based MARL outperforms the swarm-based MARL with stronger convergence features in learning criteria and higher spreading of agents for area coverage.

READ FULL TEXT

page 1

page 3

page 5

page 7

research
07/17/2023

Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

In multi-timescale multi-agent reinforcement learning (MARL), agents int...
research
05/10/2019

Emergent Escape-based Flocking Behavior using Multi-Agent Reinforcement Learning

In nature, flocking or swarm behavior is observed in many species as it ...
research
04/17/2023

Control and Coordination of a SWARM of Unmanned Surface Vehicles using Deep Reinforcement Learning in ROS

An unmanned surface vehicle (USV) can perform complex missions by contin...
research
02/14/2022

Efficient quantitative assessment of robot swarms: coverage and targeting Lévy strategies

Biologically inspired strategies have long been adapted to swarm robotic...
research
11/11/2022

Efficient Domain Coverage for Vehicles with Second Order Dynamics via Multi-Agent Reinforcement Learning

Collaborative autonomous multi-agent systems covering a specified area h...
research
07/18/2017

Energy-Efficient Strip Monitoring by Identical Devices Directed to One Side Along the Strip and Having a Coverage Area in the Form of a Sector

In this paper, we propose several regular covers with identical sectors ...
research
08/31/2018

Gradual Collective Upgrade of a Swarm of Autonomous Buoys for Dynamic Ocean Monitoring

Swarms of autonomous surface vehicles equipped with environmental sensor...

Please sign up or login with your details

Forgot password? Click here to reset