RASR: Risk-Averse Soft-Robust MDPs with EVaR and Entropic Risk

09/09/2022
by   Jia Lin Hau, et al.
0

Prior work on safe Reinforcement Learning (RL) has studied risk-aversion to randomness in dynamics (aleatory) and to model uncertainty (epistemic) in isolation. We propose and analyze a new framework to jointly model the risk associated with epistemic and aleatory uncertainties in finite-horizon and discounted infinite-horizon MDPs. We call this framework that combines Risk-Averse and Soft-Robust methods RASR. We show that when the risk-aversion is defined using either EVaR or the entropic risk, the optimal policy in RASR can be computed efficiently using a new dynamic program formulation with a time-dependent risk level. As a result, the optimal risk-averse policies are deterministic but time-dependent, even in the infinite-horizon discounted setting. We also show that particular RASR objectives reduce to risk-averse RL with mean posterior transition probabilities. Our empirical results show that our new algorithms consistently mitigate uncertainty as measured by EVaR and other standard risk measures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2022

A policy gradient approach for Finite Horizon Constrained Markov Decision Processes

The infinite horizon setting is widely adopted for problems of reinforce...
research
01/14/2023

Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures

Traditional reinforcement learning (RL) aims to maximize the expected to...
research
08/16/2023

Eliciting Risk Aversion with Inverse Reinforcement Learning via Interactive Questioning

This paper proposes a novel framework for identifying an agent's risk av...
research
09/16/2021

Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition

Reinforcement Learning (RL) is emerging as tool for tackling complex con...
research
05/18/2023

Bayesian Risk-Averse Q-Learning with Streaming Observations

We consider a robust reinforcement learning problem, where a learning ag...
research
11/04/2021

Infinite Time Horizon Safety of Bayesian Neural Networks

Bayesian neural networks (BNNs) place distributions over the weights of ...
research
02/03/2022

Challenging Common Assumptions in Convex Reinforcement Learning

The classic Reinforcement Learning (RL) formulation concerns the maximiz...

Please sign up or login with your details

Forgot password? Click here to reset