Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

11/28/2017
by   Sumeet Singh, et al.
0

The literature on Inverse Reinforcement Learning (IRL) typically assumes that humans take actions in order to minimize the expected value of a cost function, i.e., that humans are risk neutral. Yet, in practice, humans are often far from being risk neutral. To fill this gap, the objective of this paper is to devise a framework for risk-sensitive IRL in order to explicitly account for a human's risk sensitivity. To this end, we propose a flexible class of models based on coherent risk measures, which allow us to capture an entire spectrum of risk preferences from risk-neutral to worst-case. We propose efficient non-parametric algorithms based on linear programming and semi-parametric algorithms based on maximum likelihood for inferring a human's underlying risk measure and cost function for a rich class of static and dynamic decision-making settings. The resulting approach is demonstrated on a simulated driving game with ten human participants. Our method is able to infer and mimic a wide range of qualitatively different driving styles from highly risk-averse to risk-neutral in a data-efficient manner. Moreover, comparisons of the Risk-Sensitive (RS) IRL approach with a risk-neutral model show that the RS-IRL framework more accurately captures observed participant behavior both qualitatively and quantitatively, especially in scenarios where catastrophic outcomes such as collisions can occur.

READ FULL TEXT

page 5

page 13

page 28

research
09/14/2019

Active Learning for Risk-Sensitive Inverse Reinforcement Learning

One typical assumption in inverse reinforcement learning (IRL) is that h...
research
05/26/2017

Risk-Sensitive Cooperative Games for Human-Machine Systems

Autonomous systems can substantially enhance a human's efficiency and ef...
research
10/11/2022

Regret Bounds for Risk-Sensitive Reinforcement Learning

In safety-critical applications of reinforcement learning such as health...
research
09/22/2022

Newsvendor Conditional Value-at-Risk Minimisation with a Non-Parametric Approach

In the classical Newsvendor problem, one must determine the order quanti...
research
06/14/2019

Epistemic Risk-Sensitive Reinforcement Learning

We develop a framework for interacting with uncertain environments in re...
research
01/13/2023

Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning

In safety-critical decision-making scenarios being able to identify wors...
research
12/14/2018

Guaranteed satisficing and finite regret: Analysis of a cognitive satisficing value function

As reinforcement learning algorithms are being applied to increasingly c...

Please sign up or login with your details

Forgot password? Click here to reset