Generative Adversarial Imitation from Observation

07/17/2018
by   Faraz Torabi, et al.
0

Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions. The lack of action information both distinguishes IfO from most of the literature in imitation learning, and also sets it apart as a method that may enable agents to learn from large set of previously inapplicable resources such as internet videos. In this paper, we propose both a general framework for IfO approaches and propose a new IfO approach based on generative adversarial networks called generative adversarial imitation from observation (GAIfO). We demonstrate that this approach performs comparably to classical imitation learning approaches (which have access to the demonstrator's actions) and significantly outperforms existing imitation from observation methods in high-dimensional simulation environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/01/2022

Adversarial Imitation Learning from Video using a State Observer

The imitation learning research community has recently made significant ...
research
05/04/2018

Behavioral Cloning from Observation

Humans often learn how to perform tasks via imitation: they observe othe...
research
12/05/2017

Multimodal Storytelling via Generative Adversarial Imitation Learning

Deriving event storylines is an effective summarization method to succin...
research
10/06/2017

Socially-compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning

We present an approach for mobile robots to learn to navigate in pedestr...
research
02/16/2020

Correlated Adversarial Imitation Learning

A novel imitation learning algorithm is introduced by applying a game-th...
research
08/04/2020

An Imitation from Observation Approach to Sim-to-Real Transfer

The sim to real transfer problem deals with leveraging large amounts of ...
research
10/02/2018

Video Imitation GAN: Learning control policies by imitating raw videos using generative adversarial reward estimation

Natural imitation in humans usually consists of mimicking visual demonst...

Please sign up or login with your details

Forgot password? Click here to reset