Distributional Reinforcement Learning with Dual Expectile-Quantile Regression

05/26/2023
by   Sami Jullien, et al.
0

Successful applications of distributional reinforcement learning with quantile regression prompt a natural question: can we use other statistics to represent the distribution of returns? In particular, expectile regression is known to be more efficient than quantile regression for approximating distributions, especially on extreme values, and by providing a straightforward estimator of the mean it is a natural candidate for reinforcement learning. Prior work has answered this question positively in the case of expectiles, with the major caveat that expensive computations must be performed to ensure convergence. In this work, we propose a dual expectile-quantile approach which solves the shortcomings of previous work while leveraging the complementary properties of expectiles and quantiles. Our method outperforms both quantile-based and expectile-based baselines on the MuJoCo continuous control benchmark.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2018

Implicit Quantile Networks for Distributional Reinforcement Learning

In this work, we build on recent advances in distributional reinforcemen...
research
10/01/2021

A Cramér Distance perspective on Non-crossing Quantile Regression in Distributional Reinforcement Learning

Distributional reinforcement learning (DRL) extends the value-based appr...
research
12/27/2022

Quantile Risk Control: A Flexible Framework for Bounding the Probability of High-Loss Predictions

Rigorous guarantees about the performance of predictive algorithms are n...
research
04/19/2018

Multiple factor analysis of distributional data

In the framework of Symbolic Data Analysis (SDA), distribution-variables...
research
05/06/2023

Twin support vector quantile regression

We propose a twin support vector quantile regression (TSVQR) to capture ...
research
11/05/2018

QUOTA: The Quantile Option Architecture for Reinforcement Learning

In this paper, we propose the Quantile Option Architecture (QUOTA) for e...
research
05/08/2020

Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics

The overestimation bias is one of the major impediments to accurate off-...

Please sign up or login with your details

Forgot password? Click here to reset