A Pragmatic Look at Deep Imitation Learning

08/04/2021
by   Kai Arulkumaran, et al.
0

The introduction of the generative adversarial imitation learning (GAIL) algorithm has spurred the development of scalable imitation learning approaches using deep neural networks. The GAIL objective can be thought of as 1) matching the expert policy's state distribution; 2) penalising the learned policy's state distribution; and 3) maximising entropy. While theoretically motivated, in practice GAIL can be difficult to apply, not least due to the instabilities of adversarial training. In this paper, we take a pragmatic look at GAIL and related imitation learning algorithms. We implement and automatically tune a range of algorithms in a unified experimental setup, presenting a fair evaluation between the competing methods. From our results, our primary recommendation is to consider non-adversarial methods. Furthermore, we discuss the common components of imitation learning objectives, and present promising avenues for future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2020

Non-Adversarial Imitation Learning and its Connections to Adversarial Methods

Many modern methods for imitation learning and inverse reinforcement lea...
research
06/28/2017

Energy-Based Sequence GANs for Recommendation and Their Connection to Imitation Learning

Recommender systems aim to find an accurate and efficient mapping from h...
research
02/09/2022

Imitation Learning by State-Only Distribution Matching

Imitation Learning from observation describes policy learning in a simil...
research
04/29/2023

A Coupled Flow Approach to Imitation Learning

In reinforcement learning and imitation learning, an object of central i...
research
04/04/2023

Quantum Imitation Learning

Despite remarkable successes in solving various complex decision-making ...
research
12/05/2017

Multimodal Storytelling via Generative Adversarial Imitation Learning

Deriving event storylines is an effective summarization method to succin...
research
03/04/2021

Of Moments and Matching: Trade-offs and Treatments in Imitation Learning

We provide a unifying view of a large family of previous imitation learn...

Please sign up or login with your details

Forgot password? Click here to reset