Imitation by Predicting Observations

07/08/2021
by   Andrew Jaegle, et al.
0

Imitation learning enables agents to reuse and adapt the hard-won expertise of others, offering a solution to several key challenges in learning behavior. Although it is easy to observe behavior in the real-world, the underlying actions may not be accessible. We present a new method for imitation solely from observations that achieves comparable performance to experts on challenging continuous control tasks while also exhibiting robustness in the presence of observations unrelated to the task. Our method, which we call FORM (for "Future Observation Reward Model") is derived from an inverse RL objective and imitates using a model of expert behavior learned by generative modelling of the expert's observations, without needing ground truth actions. We show that FORM performs comparably to a strong baseline IRL method (GAIL) on the DeepMind Control Suite benchmark, while outperforming GAIL in the presence of task-irrelevant features.

READ FULL TEXT
research
06/19/2023

SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models

Model-based imitation learning (MBIL) is a popular reinforcement learnin...
research
05/21/2018

Imitating Latent Policies from Observation

We describe a novel approach to imitation learning that infers latent po...
research
10/02/2019

Task-Relevant Adversarial Imitation Learning

We show that a critical problem in adversarial imitation from high-dimen...
research
05/04/2018

Behavioral Cloning from Observation

Humans often learn how to perform tasks via imitation: they observe othe...
research
02/02/2022

Imitation Learning by Estimating Expertise of Demonstrators

Many existing imitation learning datasets are collected from multiple de...
research
07/29/2023

Initial State Interventions for Deconfounded Imitation Learning

Imitation learning suffers from causal confusion. This phenomenon occurs...
research
05/23/2019

Teleoperator Imitation with Continuous-time Safety

Learning to effectively imitate human teleoperators, with generalization...

Please sign up or login with your details

Forgot password? Click here to reset