Adversarial Imitation Learning from Video using a State Observer

02/01/2022
by   Haresh Karnan, et al.
2

The imitation learning research community has recently made significant progress towards the goal of enabling artificial agents to imitate behaviors from video demonstrations alone. However, current state-of-the-art approaches developed for this problem exhibit high sample complexity due, in part, to the high-dimensional nature of video observations. Towards addressing this issue, we introduce here a new algorithm called Visual Generative Adversarial Imitation from Observation using a State Observer VGAIfO-SO. At its core, VGAIfO-SO seeks to address sample inefficiency using a novel, self-supervised state observer, which provides estimates of lower-dimensional proprioceptive state representations from high-dimensional images. We show experimentally in several continuous control environments that VGAIfO-SO is more sample efficient than other IfO algorithms at learning from video-only demonstrations and can sometimes even achieve performance close to the Generative Adversarial Imitation from Observation (GAIfO) algorithm that has privileged access to the demonstrator's proprioceptive state information.

READ FULL TEXT
research
07/17/2018

Generative Adversarial Imitation from Observation

Imitation from observation (IfO) is the problem of learning directly fro...
research
05/22/2019

Imitation Learning from Video by Leveraging Proprioception

Classically, imitation learning algorithms have been developed for ideal...
research
06/01/2021

What Matters for Adversarial Imitation Learning?

Adversarial imitation learning has become a popular framework for imitat...
research
12/18/2019

Relational Mimic for Visual Adversarial Imitation Learning

In this work, we introduce a new method for imitation learning from vide...
research
06/18/2019

Sample-efficient Adversarial Imitation Learning from Observation

Imitation from observation is the framework of learning tasks by observi...
research
07/16/2021

Visual Adversarial Imitation Learning using Variational Models

Reward function specification, which requires considerable human effort ...
research
08/04/2020

An Imitation from Observation Approach to Sim-to-Real Transfer

The sim to real transfer problem deals with leveraging large amounts of ...

Please sign up or login with your details

Forgot password? Click here to reset