Bayesian multitask inverse reinforcement learning

06/18/2011
by   Christos Dimitrakakis, et al.
0

We generalise the problem of inverse reinforcement learning to multiple tasks, from multiple demonstrations. Each one may represent one expert trying to solve a different task, or as different experts trying to solve the same task. Our main contribution is to formalise the problem as statistical preference elicitation, via a number of structured priors, whose form captures our biases about the relatedness of different tasks or expert policies. In doing so, we introduce a prior on policy optimality, which is more natural to specify. We show that our framework allows us not only to learn to efficiently from multiple experts but to also effectively differentiate between the goals of each. Possible applications include analysing the intrinsic motivations of subjects in behavioural experiments and learning from multiple teachers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/09/2018

Addressing Sample Inefficiency and Reward Bias in Inverse Reinforcement Learning

The Generative Adversarial Imitation Learning (GAIL) framework from Ho &...
research
06/10/2020

Bayesian Experience Reuse for Learning from Multiple Demonstrators

Learning from demonstrations (LfD) improves the exploration efficiency o...
research
01/11/2021

Action Priors for Large Action Spaces in Robotics

In robotics, it is often not possible to learn useful policies using pur...
research
03/23/2021

Meta-Adversarial Inverse Reinforcement Learning for Decision-making Tasks

Learning from demonstrations has made great progress over the past few y...
research
07/01/2020

Policy Improvement from Multiple Experts

Despite its promise, reinforcement learning's real-world adoption has be...
research
11/01/2019

Situated GAIL: Multitask imitation using task-conditioned adversarial inverse reinforcement learning

Generative adversarial imitation learning (GAIL) has attracted increasin...
research
03/01/2018

Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling

Recent advances in the field of inverse reinforcement learning (IRL) hav...

Please sign up or login with your details

Forgot password? Click here to reset