Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player

02/21/2021
by   Hanlin Niu, et al.
7

This paper presents a sensor-level mapless collision avoidance algorithm for use in mobile robots that map raw sensor data to linear and angular velocities and navigate in an unknown environment without a map. An efficient training strategy is proposed to allow a robot to learn from both human experience data and self-exploratory data. A game format simulation framework is designed to allow the human player to tele-operate the mobile robot to a goal and human action is also scored using the reward function. Both human player data and self-playing data are sampled using prioritized experience replay algorithm. The proposed algorithm and training strategy have been evaluated in two different experimental configurations: Environment 1, a simulated cluttered environment, and Environment 2, a simulated corridor environment, to investigate the performance. It was demonstrated that the proposed method achieved the same level of reward using only 16% of the training steps required by the standard Deep Deterministic Policy Gradient (DDPG) method in Environment 1 and 20% of that in Environment 2. In the evaluation of 20 random missions, the proposed method achieved no collision in less than 2 h and 2.5 h of training time in the two Gazebo environments respectively. The method also generated smoother trajectories than DDPG. The proposed method has also been implemented on a real robot in the real-world environment for performance evaluation. We can confirm that the trained model with the simulation software can be directly applied into the real-world scenario without further fine-tuning, further demonstrating its higher robustness than DDPG. The video and code are available: https://youtu.be/BmwxevgsdGc https://github.com/hanlinniu/turtlebot3_ddpg_collision_avoidance

READ FULL TEXT

page 1

page 4

page 5

page 6

research
08/03/2023

Avoidance Navigation Based on Offline Pre-Training Reinforcement Learning

This paper presents a Pre-Training Deep Reinforcement Learning(DRL) for ...
research
02/10/2020

On Reward Shaping for Mobile Robot Navigation: A Reinforcement Learning and SLAM Based Approach

We present a map-less path planning algorithm based on Deep Reinforcemen...
research
08/11/2018

Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios

In this paper, we present a decentralized sensor-level collision avoidan...
research
09/28/2017

Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning

Developing a safe and efficient collision avoidance policy for multiple ...
research
05/28/2020

Deep Reinforcement learning for real autonomous mobile robot navigation in indoor environments

Deep Reinforcement Learning has been successfully applied in various com...
research
02/18/2019

DIViS: Domain Invariant Visual Servoing for Collision-Free Goal Reaching

Robots should understand both semantics and physics to be functional in ...
research
06/01/2018

Multi-vehicle Flocking Control with Deep Deterministic Policy Gradient Method

Flocking control has been studied extensively along with the wide applic...

Please sign up or login with your details

Forgot password? Click here to reset