Stochastic Approximation for Risk-aware Markov Decision Processes

05/11/2018
by   Wenjie Huang, et al.
0

In this paper, we develop a stochastic approximation type algorithm to solve finite state and action, infinite-horizon, risk-aware Markov decision processes. Our algorithm is based on solving stochastic saddle-point problems for risk estimation and doing Q-learning for finding the optimal risk-aware policy. We show that several widely investigated risk measures (e.g. conditional value-at-risk, optimized certainty equivalent, and absolute semi-deviation) can be expressed as such stochastic saddle-point problems. We establish the almost sure convergence and convergence rate results for our overall algorithm. For error tolerance ϵ and learning rate k, the convergence rate of our algorithm is Ω(((1/δϵ)/ϵ^2)^1/k+((1/ϵ))^1/(1-k)) with probability 1-δ.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/28/2023

Rate-Optimal Policy Optimization for Linear Markov Decision Processes

We study regret minimization in online episodic linear Markov Decision P...
research
09/07/2018

A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

Risk management in dynamic decision problems is a primary concern in man...
research
01/30/2023

Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents

The optimized certainty equivalent (OCE) is a family of risk measures th...
research
10/16/2012

An Approximate Solution Method for Large Risk-Averse Markov Decision Processes

Stochastic domains often involve risk-averse decision makers. While rece...
research
10/19/2021

Planning for Package Deliveries in Risky Environments Over Multiple Epochs

We study a risk-aware robot planning problem where a dispatcher must con...
research
10/10/2016

Situational Awareness by Risk-Conscious Skills

Hierarchical Reinforcement Learning has been previously shown to speed u...
research
07/13/2023

Entropic Risk for Turn-Based Stochastic Games

Entropic risk (ERisk) is an established risk measure in finance, quantif...

Please sign up or login with your details

Forgot password? Click here to reset