Self-Supervised Disentangled Representation Learning for Third-Person Imitation Learning

08/02/2021
by   Jinghuan Shang, et al.
0

Humans learn to imitate by observing others. However, robot imitation learning generally requires expert demonstrations in the first-person view (FPV). Collecting such FPV videos for every robot could be very expensive. Third-person imitation learning (TPIL) is the concept of learning action policies by observing other agents in a third-person view (TPV), similar to what humans do. This ultimately allows utilizing human and robot demonstration videos in TPV from many different data sources, for the policy learning. In this paper, we present a TPIL approach for robot tasks with egomotion. Although many robot tasks with ground/aerial mobility often involve actions with camera egomotion, study on TPIL for such tasks has been limited. Here, FPV and TPV observations are visually very different; FPV shows egomotion while the agent appearance is only observable in TPV. To enable better state learning for TPIL, we propose our disentangled representation learning method. We use a dual auto-encoder structure plus representation permutation loss and time-contrastive loss to ensure the state and viewpoint representations are well disentangled. Our experiments show the effectiveness of our approach.

READ FULL TEXT

page 1

page 3

page 6

research
07/11/2017

Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation

Imitation learning is an effective approach for autonomous systems to ac...
research
03/14/2023

Sample-efficient Adversarial Imitation Learning

Imitation learning, in which learning is performed by demonstration, has...
research
02/24/2023

Language-Driven Representation Learning for Robotics

Recent work in visual representation learning for robotics demonstrates ...
research
12/02/2021

The Surprising Effectiveness of Representation Learning for Visual Imitation

While visual imitation learning offers one of the most effective ways of...
research
05/16/2022

An Empirical Investigation of Representation Learning for Imitation

Imitation learning often needs a large demonstration set in order to han...
research
10/08/2019

Model-based Behavioral Cloning with Future Image Similarity Learning

We present a visual imitation learning framework that enables learning o...
research
10/16/2020

On the Guaranteed Almost Equivalence between Imitation Learning from Observation and Demonstration

Imitation learning from observation (LfO) is more preferable than imitat...

Please sign up or login with your details

Forgot password? Click here to reset