Residual Policy Learning

12/15/2018
by   Tom Silver, et al.
0

We present Residual Policy Learning (RPL): a simple method for improving nondifferentiable policies using model-free deep reinforcement learning. RPL thrives in complex robotic manipulation tasks where good but imperfect controllers are available. In these tasks, reinforcement learning from scratch remains data-inefficient or intractable, but learning a residual on top of the initial controller can yield substantial improvement. We study RPL in five challenging MuJoCo tasks involving partial observability, sensor noise, model misspecification, and controller miscalibration. By combining learning with control algorithms, RPL can perform long-horizon, sparse-reward tasks for which reinforcement learning alone fails. Moreover, we find that RPL consistently and substantially improves on the initial controllers. We argue that RPL is a promising approach for combining the complementary strengths of deep reinforcement learning and robotic control, pushing the boundaries of what either can achieve independently.

READ FULL TEXT

page 1

page 3

page 9

research
07/21/2021

Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

We present Bayesian Controller Fusion (BCF): a hybrid control strategy t...
research
11/24/2020

Achieving Sample-Efficient and Online-Training-Safe Deep Reinforcement Learning with Base Controllers

Application of Deep Reinforcement Learning (DRL) algorithms in real-worl...
research
01/30/2023

Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem

Many instances of similar or almost-identical industrial machines or too...
research
10/09/2018

Distributed Wildfire Surveillance with Autonomous Aircraft using Deep Reinforcement Learning

Teams of autonomous unmanned aircraft can be used to monitor wildfires, ...
research
05/03/2019

Deep Residual Reinforcement Learning

We revisit residual algorithms in both model-free and model-based reinfo...
research
12/11/2022

Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks

In order to avoid conventional controlling methods which created obstacl...
research
09/20/2019

How Much Do Unstated Problem Constraints Limit Deep Robotic Reinforcement Learning?

Deep Reinforcement Learning is a promising paradigm for robotic control ...

Please sign up or login with your details

Forgot password? Click here to reset