Ada-NAV: Adaptive Trajectory-Based Sample Efficient Policy Learning for Robotic Navigation

06/09/2023
by   Bhrij Patel, et al.
0

Reinforcement learning methods, while effective for learning robotic navigation strategies, are known to be highly sample inefficient. This sample inefficiency comes in part from not suitably balancing the explore-exploit dilemma, especially in the presence of non-stationarity, during policy optimization. To incorporate a balance of exploration-exploitation for sample efficiency, we propose Ada-NAV, an adaptive trajectory length scheme where the length grows as a policy's randomness, represented by its Shannon or differential entropy, decreases. Our adaptive trajectory length scheme emphasizes exploration at the beginning of training due to more frequent gradient updates and emphasizes exploitation later on with longer trajectories. In gridworld, simulated robotic environments, and real-world robotic experiments, we demonstrate the merits of the approach over constant and randomly sampled trajectory lengths in terms of performance and sample efficiency. For a fixed sample budget, Ada-NAV results in an 18 navigation success rate, a 20-38 9.32 other methods. We also demonstrate that Ada-NAV can be transferred and integrated into a Clearpath Husky robot without significant performance degradation.

READ FULL TEXT

page 2

page 6

page 7

research
06/17/2019

Robotic Navigation using Entropy-Based Exploration

Robotic navigation concerns the task in which a robot should be able to ...
research
07/23/2020

Deep Reinforcement Learning based Automatic Exploration for Navigation in Unknown Environment

This paper investigates the automatic exploration problem under the unkn...
research
04/15/2021

Rule-Based Reinforcement Learning for Efficient Robot Navigation with Space Reduction

For real-world deployments, it is critical to allow robots to navigate i...
research
01/19/2019

Lifelong Federated Reinforcement Learning: A Learning Architecture for Navigation in Cloud Robotic Systems

This paper was motivated by the problem of how to make robots fuse and t...
research
10/21/2019

Combining Benefits from Trajectory Optimization and Deep Reinforcement Learning

Recent breakthroughs both in reinforcement learning and trajectory optim...
research
06/18/2021

Sample Efficient Social Navigation Using Inverse Reinforcement Learning

In this paper, we present an algorithm to efficiently learn socially-com...
research
01/23/2021

Rethinking Exploration for Sample-Efficient Policy Learning

Off-policy reinforcement learning for control has made great strides in ...

Please sign up or login with your details

Forgot password? Click here to reset