An Imitation from Observation Approach to Sim-to-Real Transfer

08/04/2020
by   Siddarth Desai, et al.
5

The sim to real transfer problem deals with leveraging large amounts of inexpensive simulation experience to help artificial agents learn behaviors intended for the real world more efficiently. One approach to sim-to-real transfer is using interactions with the real world to make the simulator more realistic, called grounded sim to-real transfer. In this paper, we show that a particular grounded sim-to-real approach, grounded action transformation, is closely related to the problem of imitation from observation IfO, learning behaviors that mimic the observations of behavior demonstrations. After establishing this relationship, we hypothesize that recent state-of-the-art approaches from the IfO literature can be effectively repurposed for such grounded sim-to-real transfer. To validate our hypothesis we derive a new sim-to-real transfer algorithm - generative adversarial reinforced action transformation (GARAT) - based on adversarial imitation from observation techniques. We run experiments in several simulation domains with mismatched dynamics, and find that agents trained with GARAT achieve higher returns in the real world compared to existing black-box sim-to-real methods

READ FULL TEXT
research
08/04/2020

Reinforced Grounded Action Transformation for Sim-to-Real Transfer

Robots can learn to do complex tasks in simulation, but often, learned b...
research
07/17/2018

Generative Adversarial Imitation from Observation

Imitation from observation (IfO) is the problem of learning directly fro...
research
08/04/2020

Stochastic Grounded Action Transformation for Robot Learning in Simulation

Robot control policies learned in simulation do not often transfer well ...
research
02/01/2022

Adversarial Imitation Learning from Video using a State Observer

The imitation learning research community has recently made significant ...
research
04/28/2020

Augmented Behavioral Cloning from Observation

Imitation from observation is a computational technique that teaches an ...
research
02/23/2023

K-SHAP: Policy Clustering Algorithm for Anonymous State-Action Pairs

Learning agent behaviors from observational data has shown to improve ou...

Please sign up or login with your details

Forgot password? Click here to reset