Robust Imitation Learning against Variations in Environment Dynamics

06/19/2022
by   Jongseong Chae, et al.
0

In this paper, we propose a robust imitation learning (IL) framework that improves the robustness of IL when environment dynamics are perturbed. The existing IL framework trained in a single environment can catastrophically fail with perturbations in environment dynamics because it does not capture the situation that underlying environment dynamics can be changed. Our framework effectively deals with environments with varying dynamics by imitating multiple experts in sampled environment dynamics to enhance the robustness in general variations in environment dynamics. In order to robustly imitate the multiple sample experts, we minimize the risk with respect to the Jensen-Shannon divergence between the agent's policy and each of the sample experts. Numerical results show that our algorithm significantly improves robustness against dynamics perturbations compared to conventional IL baselines.

READ FULL TEXT

page 9

page 23

research
08/23/2020

ADAIL: Adaptive Adversarial Imitation Learning

We present the ADaptive Adversarial Imitation Learning (ADAIL) algorithm...
research
02/09/2022

Imitation Learning by State-Only Distribution Matching

Imitation Learning from observation describes policy learning in a simil...
research
03/01/2023

MEGA-DAgger: Imitation Learning with Multiple Imperfect Experts

Imitation learning has been widely applied to various autonomous systems...
research
02/02/2020

Combating False Negatives in Adversarial Imitation Learning

In adversarial imitation learning, a discriminator is trained to differe...
research
08/24/2023

Conditional Kernel Imitation Learning for Continuous State Environments

Imitation Learning (IL) is an important paradigm within the broader rein...
research
06/26/2020

Intrinsic Reward Driven Imitation Learning via Generative Model

Imitation learning in a high-dimensional environment is challenging. Mos...
research
06/08/2022

Constrained Imitation Learning for a Flapping Wing Unmanned Aerial Vehicle

This paper presents a data-driven optimal control policy for a micro fla...

Please sign up or login with your details

Forgot password? Click here to reset