Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method

06/01/2018
by   Yang Lyu, et al.
0

Flocking control has been studied extensively along with the wide application of multi-vehicle systems. In this paper the Multi-vehicles System (MVS) flocking control with collision avoidance and communication preserving is considered based on the deep reinforcement learning framework. Specifically the deep deterministic policy gradient (DDPG) with centralized training and distributed execution process is implemented to obtain the flocking control policy. First, to avoid the dynamically changed observation of state, a three layers tensor based representation of the observation is used so that the state remains constant although the observation dimension is changing. A reward function is designed to guide the way-points tracking, collision avoidance and communication preserving. The reward function is augmented by introducing the local reward function of neighbors. Finally, a centralized training process which trains the shared policy based on common training set among all agents. The proposed method is tested under simulated scenarios with different setup.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2022

Reinforcement learning reward function in unmanned aerial vehicle control tasks

This paper presents a new reward function that can be used for deep rein...
research
08/04/2023

Vehicles Control: Collision Avoidance using Federated Deep Reinforcement Learning

In the face of growing urban populations and the escalating number of ve...
research
02/08/2017

Autonomous Braking System via Deep Reinforcement Learning

In this paper, we propose a new autonomous braking system based on deep ...
research
08/05/2021

Deep Reinforcement Learning for Continuous Docking Control of Autonomous Underwater Vehicles: A Benchmarking Study

Docking control of an autonomous underwater vehicle (AUV) is a task that...
research
02/24/2021

Hybrid Car-Following Strategy based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

Deep deterministic policy gradient (DDPG) based car-following strategy c...
research
02/21/2021

Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player

This paper presents a sensor-level mapless collision avoidance algorithm...
research
10/12/2022

Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning

Collision avoidance is a crucial task in vision-guided autonomous naviga...

Please sign up or login with your details

Forgot password? Click here to reset