On Combining Expert Demonstrations in Imitation Learning via Optimal Transport

07/20/2023
by   Ilana Sebag, et al.
0

Imitation learning (IL) seeks to teach agents specific tasks through expert demonstrations. One of the key approaches to IL is to define a distance between agent and expert and to find an agent policy that minimizes that distance. Optimal transport methods have been widely used in imitation learning as they provide ways to measure meaningful distances between agent and expert trajectories. However, the problem of how to optimally combine multiple expert demonstrations has not been widely studied. The standard method is to simply concatenate state (-action) trajectories, which is problematic when trajectories are multi-modal. We propose an alternative method that uses a multi-marginal optimal transport distance and enables the combination of multiple and diverse state-trajectories in the OT sense, providing a more sensible geometric average of the demonstrations. Our approach enables an agent to learn from several experts, and its efficiency is analyzed on OpenAI Gym control environments and demonstrates that the standard method is not always optimal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2023

Unlabeled Imperfect Demonstrations in Adversarial Imitation Learning

Adversarial imitation learning has become a widely used imitation learni...
research
08/20/2020

Imitation Learning with Sinkhorn Distances

Imitation learning algorithms have been interpreted as variants of diver...
research
06/30/2022

Watch and Match: Supercharging Imitation with Regularized Optimal Transport

Imitation learning holds tremendous promise in learning policies efficie...
research
07/27/2023

Imitating Complex Trajectories: Bridging Low-Level Stability and High-Level Behavior

We propose a theoretical framework for studying the imitation of stochas...
research
12/27/2022

Behavioral Cloning via Search in Video PreTraining Latent Space

Our aim is to build autonomous agents that can solve tasks in environmen...
research
04/21/2023

Self-Supervised Adversarial Imitation Learning

Behavioural cloning is an imitation learning technique that teaches an a...
research
11/06/2019

A Divergence Minimization Perspective on Imitation Learning Methods

In many settings, it is desirable to learn decision-making and control p...

Please sign up or login with your details

Forgot password? Click here to reset