Variance Reduced Advantage Estimation with δ Hindsight Credit Assignment

11/19/2019
by   Kenny Young, et al.
0

Hindsight Credit Assignment (HCA) refers to a recently proposed family of methods for producing more efficient credit assignment in reinforcement learning. These methods work by explicitly estimating the probability that certain actions were taken in the past given present information. Prior work has studied the properties of such methods and demonstrated their behaviour empirically. We extend this work by introducing a particular HCA algorithm which has provably lower variance than the conventional Monte-Carlo estimator when the necessary functions can be estimated exactly. This result provides a strong theoretical basis for how HCA could be broadly useful.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2019

Hindsight Credit Assignment

We consider the problem of efficient credit assignment in reinforcement ...
research
06/08/2021

Towards Practical Credit Assignment for Deep Reinforcement Learning

Credit assignment is a fundamental problem in reinforcement learning, th...
research
06/29/2023

Would I have gotten that reward? Long-term credit assignment by counterfactual contribution analysis

To make reinforcement learning more sample efficient, we need better cre...
research
01/07/2019

Credit Assignment Techniques in Stochastic Computation Graphs

Stochastic computation graphs (SCGs) provide a formalism to represent st...
research
03/07/2022

On Credit Assignment in Hierarchical Reinforcement Learning

Hierarchical Reinforcement Learning (HRL) has held longstanding promise ...
research
10/26/2020

Forethought and Hindsight in Credit Assignment

We address the problem of credit assignment in reinforcement learning an...
research
06/01/2022

Predecessor Features

Any reinforcement learning system must be able to identify which past ev...

Please sign up or login with your details

Forgot password? Click here to reset