Benchmarking Reinforcement Learning Techniques for Autonomous Navigation

by   Zifan Xu, et al.

Deep reinforcement learning (RL) has brought many successes for autonomous robot navigation. However, there still exists important limitations that prevent real-world use of RL-based navigation systems. For example, most learning approaches lack safety guarantees; and learned navigation systems may not generalize well to unseen environments. Despite a variety of recent learning techniques to tackle these challenges in general, a lack of an open-source benchmark and reproducible learning methods specifically for autonomous navigation makes it difficult for roboticists to choose what learning methods to use for their mobile robots and for learning researchers to identify current shortcomings of general learning methods for autonomous navigation. In this paper, we identify four major desiderata of applying deep RL approaches for autonomous navigation: (D1) reasoning under uncertainty, (D2) safety, (D3) learning from limited trial-and-error data, and (D4) generalization to diverse and novel environments. Then, we explore four major classes of learning techniques with the purpose of achieving one or more of the four desiderata: memory-based neural network architectures (D1), safe RL (D2), model-based RL (D2, D3), and domain randomization (D4). By deploying these learning techniques in a new open-source large-scale navigation benchmark and real-world environments, we perform a comprehensive study aimed at establishing to what extent can these techniques achieve these desiderata for RL-based navigation systems.


page 2

page 3

page 4


Learning to be Safe: Deep RL with a Safety Critic

Safety is an essential component for deploying reinforcement learning (R...

Reinforcement Learning-Based Control of CrazyFlie 2.X Quadrotor

The objective of the project is to explore synergies between classical c...

A comparative evaluation of machine learning methods for robot navigation through human crowds

Robot navigation through crowds poses a difficult challenge to AI system...

Generalizing from a few environments in safety-critical reinforcement learning

Before deploying autonomous agents in the real world, we need to be conf...

Decomposing the Prediction Problem; Autonomous Navigation by neoRL Agents

Navigating the world is a fundamental ability for any living entity. Acc...

Ultrasound-Guided Robotic Navigation with Deep Reinforcement Learning

In this paper we introduce the first reinforcement learning (RL) based r...

Please sign up or login with your details

Forgot password? Click here to reset