KF-LAX: Kronecker-factored curvature estimation for control variate optimization in reinforcement learning

12/11/2018
by   Mohammad Firouzi, et al.
0

A key challenge for gradient based optimization methods in model-free reinforcement learning is to develop an approach that is sample efficient and has low variance. In this work, we apply Kronecker-factored curvature estimation technique (KFAC) to a recently proposed gradient estimator for control variate optimization, RELAX, to increase the sample efficiency of using this gradient estimation method in reinforcement learning. The performance of the proposed method is demonstrated on a synthetic problem and a set of three discrete control task Atari games.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2017

Stochastic Variance Reduction for Policy Gradient Estimation

Recent advances in policy gradient methods and deep learning have demons...
research
05/15/2018

Local Saddle Point Optimization: A Curvature Exploitation Approach

Gradient-based optimization methods are the most popular choice for find...
research
08/21/2020

Model-Free Episodic Control with State Aggregation

Episodic control provides a highly sample-efficient method for reinforce...
research
01/05/2022

Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation

In model-free deep reinforcement learning (RL) algorithms, using noisy v...
research
01/07/2019

Credit Assignment Techniques in Stochastic Computation Graphs

Stochastic computation graphs (SCGs) provide a formalism to represent st...
research
06/03/2019

Proximal Reliability Optimization for Reinforcement Learning

Despite the numerous advances, reinforcement learning remains away from ...
research
09/11/2021

Bundled Gradients through Contact via Randomized Smoothing

The empirical success of derivative-free methods in reinforcement learni...

Please sign up or login with your details

Forgot password? Click here to reset