A Comparative Analysis of Expected and Distributional Reinforcement Learning

01/30/2019
by   Clare Lyle, et al.
6

Since their introduction a year ago, distributional approaches to reinforcement learning (distributional RL) have produced strong results relative to the standard approach which models expected values (expected RL). However, aside from convergence guarantees, there have been few theoretical results investigating the reasons behind the improvements distributional RL provides. In this paper we begin the investigation into this fundamental question by analyzing the differences in the tabular, linear approximation, and non-linear approximation settings. We prove that in many realizations of the tabular and linear approximation settings, distributional RL behaves exactly the same as expected RL. In cases where the two methods behave differently, distributional RL can in fact hurt performance when it does not induce identical behaviour. We then continue with an empirical analysis comparing distributional and expected RL methods in control settings with non-linear approximators to tease apart where the improvements from distributional RL methods are coming from.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2022

The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning

We study the multi-step off-policy learning approach to distributional R...
research
02/22/2018

An Analysis of Categorical Distributional Reinforcement Learning

Distributional approaches to value-based reinforcement learning model th...
research
05/13/2018

GAN Q-learning

Distributional reinforcement learning (distributional RL) has seen empir...
research
05/20/2018

Nonlinear Distributional Gradient Temporal-Difference Learning

We devise a distributional variant of gradient temporal-difference (TD) ...
research
03/03/2022

On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency

This thesis rigorously studies fundamental reinforcement learning (RL) m...
research
02/08/2019

Distributional reinforcement learning with linear function approximation

Despite many algorithmic advances, our theoretical understanding of prac...
research
05/14/2022

Interpretable Stochastic Model Predictive Control using Distributional Reinforced Estimation for Quadrotor Tracking Systems

This paper presents a novel trajectory tracker for autonomous quadrotor ...

Please sign up or login with your details

Forgot password? Click here to reset