Cross-Domain Imitation Learning via Optimal Transport

10/07/2021
by   Arnaud Fickinger, et al.
1

Cross-domain imitation learning studies how to leverage expert demonstrations of one agent to train an imitation agent with a different embodiment or morphology. Comparing trajectories and stationary distributions between the expert and imitation agents is challenging because they live on different systems that may not even have the same dimensionality. We propose Gromov-Wasserstein Imitation Learning (GWIL), a method for cross-domain imitation that uses the Gromov-Wasserstein distance to align and compare states between the different spaces of the agents. Our theory formally characterizes the scenarios where GWIL preserves optimality, revealing its possibilities and limitations. We demonstrate the effectiveness of GWIL in non-trivial continuous control domains ranging from simple rigid transformation of the expert domain to arbitrary transformation of the state-action space.

READ FULL TEXT

page 6

page 8

research
09/24/2022

Learn what matters: cross-domain imitation learning with task-relevant embeddings

We study how an autonomous agent learns to perform a task from demonstra...
research
05/20/2021

Cross-domain Imitation from Observations

Imitation learning seeks to circumvent the difficulty in designing prope...
research
06/02/2020

Cross-Domain Imitation Learning with a Dual Structure

In this paper, we consider cross-domain imitation learning (CDIL) in whi...
research
08/20/2020

Imitation Learning with Sinkhorn Distances

Imitation learning algorithms have been interpreted as variants of diver...
research
09/13/2021

Cross Domain Robot Imitation with Invariant Representation

Animals are able to imitate each others' behavior, despite their differe...
research
10/02/2019

Scenario Generalization of Data-driven Imitation Models in Crowd Simulation

Crowd simulation, the study of the movement of multiple agents in comple...
research
09/30/2019

Cross Domain Imitation Learning

We study the question of how to imitate tasks across domains with discre...

Please sign up or login with your details

Forgot password? Click here to reset