On solutions of the distributional Bellman equation

01/31/2022
by   Julian Gerstenberg, et al.
0

In distributional reinforcement learning not only expected returns but the complete return distributions of a policy is taken into account. The return distribution for a fixed policy is given as the fixed point of an associated distributional Bellman operator. In this note we consider general distributional Bellman operators and study existence and uniqueness of its fixed points as well as their tail properties. We give necessary and sufficient conditions for existence and uniqueness of return distributions and identify cases of regular variation. We link distributional Bellman equations to multivariate distributional equations of the form X =_d AX + B, where X and B are d-dimensional random vectors, A a random d× d matrix and X and (A,B) are independent. We show that any fixed-point of a distributional Bellman operator can be obtained as the vector of marginal laws of a solution to such a multivariate distributional equation. This makes the general theory of such equations applicable to the distributional reinforcement learning setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2022

Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning

Continuous-time reinforcement learning offers an appealing formalism for...
research
03/20/2021

Bayesian Distributional Policy Gradients

Distributional Reinforcement Learning (RL) maintains the entire probabil...
research
05/12/2019

The compound product distribution; a solution to the distributional equation X=AX+1

The solution of X=AX+1 is analyzed for a discrete variable A with P...
research
11/22/2020

Non-Identifiability in Network Autoregressions

We study identification in autoregressions defined on a general network....
research
08/06/2018

Distributional Multivariate Policy Evaluation and Exploration with the Bellman GAN

The recently proposed distributional approach to reinforcement learning ...
research
03/23/2023

Policy Evaluation in Distributional LQR

Distributional reinforcement learning (DRL) enhances the understanding o...

Please sign up or login with your details

Forgot password? Click here to reset