Mature GAIL: Imitation Learning for Low-level and High-dimensional Input using Global Encoder and Cost Transformation

09/07/2019
by   Wonsup Shin, et al.
0

Recently, GAIL framework and various variants have shown remarkable possibilities for solving practical MDP problems. However, detailed researches of low-level, and high-dimensional state input in this framework, such as image sequences, has not been conducted. Furthermore, the cost function learned in the traditional GAIL frame-work only lies on a negative range, acting as a non-penalized reward and making the agent difficult to learn the optimal policy. In this paper, we propose a new algorithm based on the GAIL framework that includes a global encoder and the reward penalization mechanism. The global encoder solves two issues that arise when applying GAIL framework to high-dimensional image state. Also, it is shown that the penalization mechanism provides more adequate reward to the agent, resulting in stable performance improvement. Our approach's potential can be backed up by the fact that it is generally applicable to variants of GAIL framework. We conducted in-depth experiments by applying our methods to various variants of the GAIL framework. And, the results proved that our method significantly improves the performances when it comes to low-level and high-dimensional tasks.

READ FULL TEXT

page 5

page 6

research
12/10/2019

Deep Bayesian Reward Learning from Preferences

Bayesian inverse reinforcement learning (IRL) methods are ideal for safe...
research
06/22/2022

Latent Policies for Adversarial Imitation Learning

This paper considers learning robot locomotion and manipulation tasks fr...
research
03/03/2020

Hierarchically Decoupled Imitation for Morphological Transfer

Learning long-range behaviors on complex high-dimensional agents is a fu...
research
06/26/2020

Intrinsic Reward Driven Imitation Learning via Generative Model

Imitation learning in a high-dimensional environment is challenging. Mos...
research
09/09/2019

Expert-Level Atari Imitation Learning from Demonstrations Only

One of the key issues for imitation learning lies in making policy learn...
research
10/04/2022

Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees

We consider the task of estimating a structural model of dynamic decisio...
research
12/29/2021

DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks

We propose an efficient, reliable, and interpretable global solution met...

Please sign up or login with your details

Forgot password? Click here to reset