Generative Adversarial Imitation Learning

06/10/2016
by   Jonathan Ho, et al.
0

Consider learning a policy from example expert behavior, without interaction with the expert or access to reinforcement signal. One approach is to recover the expert's cost function with inverse reinforcement learning, then extract a policy from that cost function with reinforcement learning. This approach is indirect and can be slow. We propose a new general framework for directly extracting a policy from data, as if it were obtained by reinforcement learning following inverse reinforcement learning. We show that a certain instantiation of our framework draws an analogy between imitation learning and generative adversarial networks, from which we derive a model-free imitation learning algorithm that obtains significant performance gains over existing model-free methods in imitating complex behaviors in large, high-dimensional environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2019

Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation

We consider the problem of imitation learning from a finite set of exper...
research
04/21/2018

Event Extraction with Generative Adversarial Imitation Learning

We propose a new method for event extraction (EE) task based on an imita...
research
05/27/2021

Generative Adversarial Imitation Learning for Empathy-based AI

Generative adversarial imitation learning (GAIL) is a model-free algorit...
research
04/04/2023

Quantum Imitation Learning

Despite remarkable successes in solving various complex decision-making ...
research
11/09/2020

Safe Trajectory Planning Using Reinforcement Learning for Self Driving

Self-driving vehicles must be able to act intelligently in diverse and d...
research
04/20/2020

Energy-Based Imitation Learning

We tackle a common scenario in imitation learning (IL), where agents try...
research
11/11/2016

A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models

Generative adversarial networks (GANs) are a recently proposed class of ...

Please sign up or login with your details

Forgot password? Click here to reset