Log In Sign Up

Decentralized Multi-Agent Pursuit using Deep Reinforcement Learning

by   Cristino de Souza Jr, et al.

Pursuit-evasion is the problem of capturing mobile targets with one or more pursuers. We use deep reinforcement learning for pursuing an omni-directional target with multiple, homogeneous agents that are subject to unicycle kinematic constraints. We use shared experience to train a policy for a given number of pursuers that is executed independently by each agent at run-time. The training benefits from curriculum learning, a sweeping-angle ordering to locally represent neighboring agents and encouraging good formations with reward structure that combines individual and group rewards. Simulated experiments with a reactive evader and up to eight pursuers show that our learning-based approach, with non-holonomic agents, performs on par with classical algorithms with omni-directional agents, and outperforms their non-holonomic adaptations. The learned policy is successfully transferred to the real world in a proof-of-concept demonstration with three motion-constrained pursuer drones.


Deep Reinforcement Learning for Swarm Systems

Recently, deep reinforcement learning (RL) methods have been applied suc...

PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control

This paper develops an efficient multi-agent deep reinforcement learning...

A novel approach for multi-agent cooperative pursuit to capture grouped evaders

An approach of mobile multi-agent pursuit based on application of self-o...

Towards the Use of Deep Reinforcement Learning with Global Policy For Query-based Extractive Summarisation

Supervised approaches for text summarisation suffer from the problem of ...

Prosocial learning agents solve generalized Stag Hunts better than selfish ones

Deep reinforcement learning has become an important paradigm for constru...

Graph-based Cluttered Scene Generation and Interactive Exploration using Deep Reinforcement Learning

We introduce a novel method to teach a robotic agent to interactively ex...

Learning to flock through reinforcement

Flocks of birds, schools of fish, insects swarms are examples of coordin...