Sub-optimal Policy Aided Multi-Agent Reinforcement Learning for Flocking Control

09/17/2022
by   Yunbo Qiu, et al.
0

Flocking control is a challenging problem, where multiple agents, such as drones or vehicles, need to reach a target position while maintaining the flock and avoiding collisions with obstacles and collisions among agents in the environment. Multi-agent reinforcement learning has achieved promising performance in flocking control. However, methods based on traditional reinforcement learning require a considerable number of interactions between agents and the environment. This paper proposes a sub-optimal policy aided multi-agent reinforcement learning algorithm (SPA-MARL) to boost sample efficiency. SPA-MARL directly leverages a prior policy that can be manually designed or solved with a non-learning method to aid agents in learning, where the performance of the policy can be sub-optimal. SPA-MARL recognizes the difference in performance between the sub-optimal policy and itself, and then imitates the sub-optimal policy if the sub-optimal policy is better. We leverage SPA-MARL to solve the flocking control problem. A traditional control method based on artificial potential fields is used to generate a sub-optimal policy. Experiments demonstrate that SPA-MARL can speed up the training process and outperform both the MARL baseline and the used sub-optimal policy.

READ FULL TEXT
research
09/27/2019

Interaction-Aware Multi-Agent Reinforcement Learning for Mobile Agents with Individual Goals

In a multi-agent setting, the optimal policy of a single agent is largel...
research
08/25/2023

Towards Optimal Head-to-head Autonomous Racing with Curriculum Reinforcement Learning

Head-to-head autonomous racing is a challenging problem, as the vehicle ...
research
01/30/2013

An Anytime Algorithm for Decision Making under Uncertainty

We present an anytime algorithm which computes policies for decision pro...
research
08/05/2020

Learning Power Control from a Fixed Batch of Data

We address how to exploit power control data, gathered from a monitored ...
research
03/08/2022

Policy Regularization for Legible Behavior

In Reinforcement Learning interpretability generally means to provide in...
research
04/05/2023

Constrained Exploration in Reinforcement Learning with Optimality Preservation

We consider a class of reinforcement-learning systems in which the agent...
research
05/08/2023

Sense, Imagine, Act: Multimodal Perception Improves Model-Based Reinforcement Learning for Head-to-Head Autonomous Racing

Model-based reinforcement learning (MBRL) techniques have recently yield...

Please sign up or login with your details

Forgot password? Click here to reset