Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients

10/02/2019
by   Chen Tessler, et al.
0

In recent years, advances in deep learning have enabled the application of reinforcement learning algorithms in complex domains. However, they lack the theoretical guarantees which are present in the tabular setting and suffer from many stability and reproducibility problems <cit.>. In this work, we suggest a simple approach for improving stability and providing probabilistic performance guarantees in off-policy actor-critic deep reinforcement learning regimes. Experiments on continuous action spaces, in the MuJoCo control suite, show that our proposed method reduces the variance of the process and improves the overall performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2018

Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations

Pretraining with expert demonstrations have been found useful in speedin...
research
07/01/2019

FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control

In recent years significant progress has been made in dealing with chall...
research
07/30/2019

Control of nonlinear, complex and black-boxed greenhouse system with reinforcement learning

Modern control theories such as systems engineering approaches try to so...
research
02/25/2020

Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration

Off-policy reinforcement learning (RL) is concerned with learning a rewa...
research
05/25/2020

Gradient Monitored Reinforcement Learning

This paper presents a novel neural network training approach for faster ...
research
03/14/2019

Deep Reinforcement Learning with Feedback-based Exploration

Deep Reinforcement Learning has enabled the control of increasingly comp...
research
08/04/2021

Risk Conditioned Neural Motion Planning

Risk-bounded motion planning is an important yet difficult problem for s...

Please sign up or login with your details

Forgot password? Click here to reset