Imitation Learning with Sinkhorn Distances

08/20/2020
by   Georgios Papagiannis, et al.
0

Imitation learning algorithms have been interpreted as variants of divergence minimization problems. The ability to compare occupancy measures between experts and learners is crucial in their effectiveness in learning from demonstrations. In this paper, we present tractable solutions by formulating imitation learning as minimization of the Sinkhorn distance between occupancy measures. The formulation combines the valuable properties of optimal transport metrics in comparing non-overlapping distributions with a cosine distance cost defined in an adversarially learned feature space. This leads to a highly discriminative critic network and optimal transport plan that subsequently guide imitation learning. We evaluate the proposed approach using both the reward metric and the Sinkhorn distance metric on a number of MuJoCo experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2023

On Combining Expert Demonstrations in Imitation Learning via Optimal Transport

Imitation learning (IL) seeks to teach agents specific tasks through exp...
research
10/07/2021

Cross-Domain Imitation Learning via Optimal Transport

Cross-domain imitation learning studies how to leverage expert demonstra...
research
04/15/2021

Skeletal Feature Compensation for Imitation Learning with Embodiment Mismatch

Learning from demonstrations in the wild (e.g. YouTube videos) is a tant...
research
06/19/2019

Wasserstein Adversarial Imitation Learning

Imitation Learning describes the problem of recovering an expert policy ...
research
07/20/2021

Critic Guided Segmentation of Rewarding Objects in First-Person Views

This work discusses a learning approach to mask rewarding objects in ima...
research
06/18/2020

Reparameterized Variational Divergence Minimization for Stable Imitation

While recent state-of-the-art results for adversarial imitation-learning...
research
06/24/2019

Generalized Multiple Correlation Coefficient as a Similarity Measurements between Trajectories

Similarity distance measure between two trajectories is an essential too...

Please sign up or login with your details

Forgot password? Click here to reset