Lexicographic Optimisation of Conditional Value at Risk and Expected Value for Risk-Averse Planning in MDPs

10/25/2021
by   Marc Rigter, et al.
16

Planning in Markov decision processes (MDPs) typically optimises the expected cost. However, optimising the expectation does not consider the risk that for any given run of the MDP, the total cost received may be unacceptably high. An alternative approach is to find a policy which optimises a risk-averse objective such as conditional value at risk (CVaR). In this work, we begin by showing that there can be multiple policies which obtain the optimal CVaR. We formulate the lexicographic optimisation problem of minimising the expected cost subject to the constraint that the CVaR of the total cost is optimal. We present an algorithm for this problem and evaluate our approach on three domains, including a road navigation domain based on real traffic data. Our experimental results demonstrate that our lexicographic approach attains improved expected cost while maintaining the optimal CVaR.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2021

Risk-Averse Stochastic Shortest Path Planning

We consider the stochastic shortest path planning problem in MDPs, i.e.,...
research
02/10/2021

Risk-Averse Bayes-Adaptive Reinforcement Learning

In this work, we address risk-averse Bayesadaptive reinforcement learnin...
research
06/06/2015

Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach

In this paper we address the problem of decision making within a Markov ...
research
01/14/2023

Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures

Traditional reinforcement learning (RL) aims to maximize the expected to...
research
05/03/2015

Metareasoning for Planning Under Uncertainty

The conventional model for online planning under uncertainty assumes tha...
research
09/11/2023

Distributional Probabilistic Model Checking

Probabilistic model checking can provide formal guarantees on the behavi...
research
01/10/2013

Robust Combination of Local Controllers

Planning problems are hard, motion planning, for example, isPSPACE-hard....

Please sign up or login with your details

Forgot password? Click here to reset