Deep Reinforcement Learning based Local Planner for UAV Obstacle Avoidance using Demonstration Data

08/06/2020
by   Lei He, et al.
0

In this paper, a deep reinforcement learning (DRL) method is proposed to address the problem of UAV navigation in an unknown environment. However, DRL algorithms are limited by the data efficiency problem as they typically require a huge amount of data before they reach a reasonable performance. To speed up the DRL training process, we developed a novel learning framework which combines imitation learning and reinforcement learning and building upon Twin Delayed DDPG (TD3) algorithm. We newly introduced both policy and Q-value network are learned using the expert demonstration during the imitation phase. To tackle the distribution mismatch problem transfer from imitation to reinforcement learning, both TD-error and decayed imitation loss are used to update the pre-trained network when start interacting with the environment. The performances of the proposed algorithm are demonstrated on the challenging 3D UAV navigation problem using depth cameras and sketched in a variety of simulation environments.

READ FULL TEXT

page 1

page 6

research
10/27/2019

BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning

The field of Deep Reinforcement Learning (DRL) has recently seen a surge...
research
02/08/2021

Towards Hierarchical Task Decomposition using Deep Reinforcement Learning for Pick and Place Subtasks

Deep Reinforcement Learning (DRL) is emerging as a promising approach to...
research
12/12/2018

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) has been applied successfully to many ...
research
03/13/2023

Sim-to-Real Deep Reinforcement Learning based Obstacle Avoidance for UAVs under Measurement Uncertainty

Deep Reinforcement Learning is quickly becoming a popular method for tra...
research
09/08/2022

Optimal mesh generation for a blade passage using deep reinforcement learning

A mesh generation method that can generate an optimal mesh for a blade p...
research
09/02/2021

Reinforcement Learning for Battery Energy Storage Dispatch augmented with Model-based Optimizer

Reinforcement learning has been found useful in solving optimal power fl...
research
08/04/2022

DL-DRL: A double-layer deep reinforcement learning approach for large-scale task scheduling of multi-UAV

This paper studies deep reinforcement learning (DRL) for the task schedu...

Please sign up or login with your details

Forgot password? Click here to reset