Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

09/29/2018
by   Arjun Sharma, et al.
0

The use of imitation learning to learn a single policy for a complex task that has multiple modes or hierarchical structure can be challenging. In fact, previous work has shown that when the modes are known, learning separate policies for each mode or sub-task can greatly improve the performance of imitation learning. In this work, we discover the interaction between sub-tasks from their resulting state-action trajectory sequences using a directed graphical model. We propose a new algorithm based on the generative adversarial imitation learning framework which automatically learns sub-task policies from unsegmented demonstrations. Our approach maximizes the directed information flow in the graphical model between sub-task latent variables and their generated trajectories. We also show how our approach connects with the existing Options framework, which is commonly used to learn hierarchical policies.

READ FULL TEXT

page 8

page 13

research
08/14/2018

Shared Multi-Task Imitation Learning for Indoor Self-Navigation

Deep imitation learning enables robots to learn from expert demonstratio...
research
12/29/2019

Hierarchical Variational Imitation Learning of Control Programs

Autonomous agents can learn by imitating teacher demonstrations of the i...
research
03/01/2019

GRP Model for Sensorimotor Learning

Learning from complex demonstrations is challenging, especially when the...
research
06/07/2023

Divide and Repair: Using Options to Improve Performance of Imitation Learning Against Adversarial Demonstrations

We consider the problem of learning to perform a task from demonstration...
research
10/07/2020

Provable Hierarchical Imitation Learning via EM

Due to recent empirical successes, the options framework for hierarchica...
research
05/06/2022

Diverse Imitation Learning via Self-Organizing Generative Models

Imitation learning is the task of replicating expert policy from demonst...
research
06/01/2021

What Matters for Adversarial Imitation Learning?

Adversarial imitation learning has become a popular framework for imitat...

Please sign up or login with your details

Forgot password? Click here to reset