Inverse Reinforcement Learning in Contextual MDPs

05/23/2019
by   Philip Korsunsky, et al.
0

We consider the Inverse Reinforcement Learning (IRL) problem in Contextual Markov Decision Processes (CMDPs). Here, the reward of the environment, which is not available to the agent, depends on a static parameter referred to as the context. Each context defines an MDP (with a different reward signal), and the agent is provided demonstrations by an expert, for different contexts. The goal is to learn a mapping from contexts to rewards, such that planning with respect to the induced reward will perform similarly to the expert, even for unseen contexts. We suggest two learning algorithms for this scenario. (1) For rewards that are a linear function of the context, we provide a method that is guaranteed to return an ϵ-optimal solution after a polynomial number of demonstrations. (2) For general reward functions, we propose black-box descent methods based on evolutionary strategies capable of working with nonlinear estimators (e.g., neural networks). We evaluate our algorithms in autonomous driving and medical treatment simulations and demonstrate their ability to learn and generalize to unseen contexts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2023

BC-IRL: Learning Generalizable Reward Functions from Demonstrations

How well do reward functions learned with inverse reinforcement learning...
research
02/08/2015

Contextual Markov Decision Processes

We consider a planning problem where the dynamics and rewards of the env...
research
06/11/2019

Towards Inverse Reinforcement Learning for Limit Order Book Dynamics

Multi-agent learning is a promising method to simulate aggregate competi...
research
02/25/2022

Context-Hierarchy Inverse Reinforcement Learning

An inverse reinforcement learning (IRL) agent learns to act intelligentl...
research
06/03/2016

Difference of Convex Functions Programming Applied to Control with Expert Data

This paper reports applications of Difference of Convex functions (DC) p...
research
06/20/2022

Benchmarking Constraint Inference in Inverse Reinforcement Learning

When deploying Reinforcement Learning (RL) agents into a physical system...
research
07/14/2021

Deep Adaptive Multi-Intention Inverse Reinforcement Learning

This paper presents a deep Inverse Reinforcement Learning (IRL) framewor...

Please sign up or login with your details

Forgot password? Click here to reset