A Bayesian Approach to Robust Inverse Reinforcement Learning

09/15/2023
by   Ran Wei, et al.
0

We consider a Bayesian approach to offline model-based inverse reinforcement learning (IRL). The proposed framework differs from existing offline model-based IRL approaches by performing simultaneous estimation of the expert's reward function and subjective model of environment dynamics. We make use of a class of prior distributions which parameterizes how accurate the expert's model of the environment is to develop efficient algorithms to estimate the expert's reward and subjective dynamics in high-dimensional settings. Our analysis reveals a novel insight that the estimated policy exhibits robust performance when the expert is believed (a priori) to have a highly accurate model of the environment. We verify this observation in the MuJoCo environments and show that our algorithms outperform state-of-the-art offline IRL algorithms.

READ FULL TEXT
research
02/09/2023

CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning

This work aims to tackle a major challenge in offline Inverse Reinforcem...
research
02/04/2021

Hybrid Adversarial Inverse Reinforcement Learning

In this paper, we investigate the problem of the inverse reinforcement l...
research
02/15/2023

Understanding Expertise through Demonstrations: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning

Offline inverse reinforcement learning (Offline IRL) aims to recover the...
research
07/10/2019

Interpretable Dynamics Models for Data-Efficient Reinforcement Learning

In this paper, we present a Bayesian view on model-based reinforcement l...
research
09/09/2021

OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distribution Matching

Inverse Reinforcement Learning (IRL) is attractive in scenarios where re...
research
09/13/2023

Offline Prompt Evaluation and Optimization with Inverse Reinforcement Learning

The recent advances in the development of Large Language Models (LLMs) l...
research
05/20/2021

Objective-aware Traffic Simulation via Inverse Reinforcement Learning

Traffic simulators act as an essential component in the operating and pl...

Please sign up or login with your details

Forgot password? Click here to reset