Probabilistic inverse reinforcement learning in unknown environments

08/09/2014
by   Aristide Tossou, et al.
0

We consider the problem of learning by demonstration from agents acting in unknown stochastic Markov environments or games. Our aim is to estimate agent preferences in order to construct improved policies for the same task that the agents are trying to solve. To do so, we extend previous probabilistic approaches for inverse reinforcement learning in known MDPs to the case of unknown dynamics or opponents. We do this by deriving two simplified probabilistic models of the demonstrator's policy and utility. For tractability, we use maximum a posteriori estimation rather than full Bayesian inference. Under a flat prior, this results in a convex optimisation problem. We find that the resulting algorithms are highly competitive against a variety of other methods for inverse reinforcement learning that do have knowledge of the dynamics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2011

Preference elicitation and inverse reinforcement learning

We state the problem of inverse reinforcement learning in terms of prefe...
research
05/21/2018

A Framework and Method for Online Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) is the problem of learning the pref...
research
12/13/2017

Inverse Reinforcement Learning for Marketing

Learning customer preferences from an observed behaviour is an important...
research
03/25/2014

Multi-agent Inverse Reinforcement Learning for Zero-sum Games

In this paper we introduce a Bayesian framework for solving a class of p...
research
11/29/2021

Dynamic Inference

Traditional statistical estimation, or statistical inference in general,...
research
06/20/2020

Langevin Dynamics for Inverse Reinforcement Learning of Stochastic Gradient Algorithms

Inverse reinforcement learning (IRL) aims to estimate the reward functio...
research
07/31/2021

Inverse Reinforcement Learning for Strategy Identification

In adversarial environments, one side could gain an advantage by identif...

Please sign up or login with your details

Forgot password? Click here to reset