Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction

05/15/2019
by   Narendra Patwardhan, et al.
0

Model-free reinforcement learning based methods such as Proximal Policy Optimization, or Q-learning typically require thousands of interactions with the environment to approximate the optimum controller which may not always be feasible in robotics due to safety and time consumption. Model-based methods such as PILCO or BlackDrops, while data-efficient, provide solutions with limited robustness and complexity. To address this tradeoff, we introduce active uncertainty reduction-based virtual environments, which are formed through limited trials conducted in the original environment. We provide an efficient method for uncertainty management, which is used as a metric for self-improvement by identification of the points with maximum expected improvement through adaptive sampling. Capturing the uncertainty also allows for better mimicking of the reward responses of the original system. Our approach enables the use of complex policy structures and reward functions through a unique combination of model-based and model-free methods, while still retaining the data efficiency. We demonstrate the validity of our method on several classic reinforcement learning problems in OpenAI gym. We prove that our approach offers a better modeling capacity for complex system dynamics as compared to established methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2019

Deep Model-Based Reinforcement Learning via Estimated Uncertainty and Conservative Policy Optimization

Model-based reinforcement learning algorithms tend to achieve higher sam...
research
06/25/2019

Uncertainty-aware Model-based Policy Optimization

Model-based reinforcement learning has the potential to be more sample e...
research
07/17/2021

High-Accuracy Model-Based Reinforcement Learning, a Survey

Deep reinforcement learning has shown remarkable success in the past few...
research
06/03/2019

Proximal Reliability Optimization for Reinforcement Learning

Despite the numerous advances, reinforcement learning remains away from ...
research
01/28/2022

Joint Differentiable Optimization and Verification for Certified Reinforcement Learning

In model-based reinforcement learning for safety-critical control system...
research
05/21/2020

Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning

Traditional robotic approaches rely on an accurate model of the environm...
research
10/10/2019

Efficient Intrinsically Motivated Robotic Grasping with Learning-Adaptive Imagination in Latent Space

Combining model-based and model-free deep reinforcement learning has sho...

Please sign up or login with your details

Forgot password? Click here to reset