Stochastic Inverse Reinforcement Learning

05/21/2019
by   Ce Ju, et al.
0

Inverse reinforcement learning (IRL) is an ill-posed inverse problem since expert demonstrations may infer many solutions of reward functions which is hard to recover by local search methods such as a gradient method. In this paper, we generalize the original IRL problem to recover a probability distribution for reward functions. We call such a generalized problem stochastic inverse reinforcement learning (SIRL) which is first formulated as an expectation optimization problem. We adopt the Monte Carlo expectation-maximization (MCEM) method, a global search method, to estimate the parameter of the probability distribution as the first solution to SIRL. With our approach, it is possible to observe the deep intrinsic property in IRL from a global viewpoint, and the technique achieves a considerable robust recovery performance on the classic learning environment, objectworld.

READ FULL TEXT

page 7

page 8

research
02/20/2020

oIRL: Robust Adversarial Inverse Reinforcement Learning with Temporally Extended Actions

Explicit engineering of reward functions for given environments has been...
research
12/04/2020

Demonstration-efficient Inverse Reinforcement Learning in Procedurally Generated Environments

Deep Reinforcement Learning achieves very good results in domains where ...
research
05/14/2023

Inverse Reinforcement Learning With Constraint Recovery

In this work, we propose a novel inverse reinforcement learning (IRL) al...
research
11/15/2021

Versatile Inverse Reinforcement Learning via Cumulative Rewards

Inverse Reinforcement Learning infers a reward function from expert demo...
research
05/29/2018

Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition

The design of a reward function often poses a major practical challenge ...
research
05/04/2020

Setting up experimental Bell test with reinforcement learning

Finding optical setups producing measurement results with a targeted pro...
research
02/20/2023

Multiagent Inverse Reinforcement Learning via Theory of Mind Reasoning

We approach the problem of understanding how people interact with each o...

Please sign up or login with your details

Forgot password? Click here to reset