Meta Inverse Reinforcement Learning via Maximum Reward Sharing for Human Motion Analysis

10/07/2017
by   Kun Li, et al.
0

This work handles the inverse reinforcement learning (IRL) problem where only a small number of demonstrations are available from a demonstrator for each high-dimensional task, insufficient to estimate an accurate reward function. Observing that each demonstrator has an inherent reward for each state and the task-specific behaviors mainly depend on a small number of key states, we propose a meta IRL algorithm that first models the reward function for each task as a distribution conditioned on a baseline reward function shared by all tasks and dependent only on the demonstrator, and then finds the most likely reward function in the distribution that explains the task-specific behaviors. We test the method in a simulated environment on path planning tasks with limited demonstrations, and show that the accuracy of the learned reward function is significantly improved. We also apply the method to analyze the motion of a patient under rehabilitation.

READ FULL TEXT
research
05/31/2018

Learning a Prior over Intent via Meta-Inverse Reinforcement Learning

A significant challenge for the practical application of reinforcement l...
research
10/29/2021

Learning to Be Cautious

A key challenge in the field of reinforcement learning is to develop age...
research
01/17/2022

Spatiotemporal Costmap Inference for MPC via Deep Inverse Reinforcement Learning

It can be difficult to autonomously produce driver behavior so that it a...
research
02/25/2022

Context-Hierarchy Inverse Reinforcement Learning

An inverse reinforcement learning (IRL) agent learns to act intelligentl...
research
04/24/2018

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

Though impressive results have been achieved in visual captioning, the t...
research
11/12/2020

Generalized Inverse Planning: Learning Lifted non-Markovian Utility for Generalizable Task Representation

In searching for a generalizable representation of temporally extended t...
research
01/24/2020

Active Task-Inference-Guided Deep Inverse Reinforcement Learning

In inverse reinforcement learning (IRL), given a Markov decision process...

Please sign up or login with your details

Forgot password? Click here to reset