Discovering hierarchies using Imitation Learning from hierarchy aware policies

12/01/2018
by   Ameet Deshpande, et al.
0

Learning options that allow agents to exhibit temporally higher order behavior has proven to be useful in increasing exploration, reducing sample complexity and for various transfer scenarios. Deep Discovery of Options (DDO) is a generative algorithm that learns a hierarchical policy along with options directly from expert trajectories. We perform a qualitative and quantitative analysis of options inferred from DDO in different domains. To this end, we suggest different value metrics like option termination condition, hinge value function error and KL-Divergence based distance metric to compare different methods. Analyzing the termination condition of the options and number of time steps the options were run revealed that the options were terminating prematurely. We suggest modifications which can be incorporated easily and alleviates the problem of shorter options and a collapse of options to the same mode.

READ FULL TEXT

page 10

page 12

research
10/06/2020

Diverse Exploration via InfoMax Options

In this paper, we study the problem of autonomously discovering temporal...
research
11/10/2017

Learning with Options that Terminate Off-Policy

A temporally abstract action, or an option, is specified by a policy and...
research
02/26/2019

The Termination Critic

In this work, we consider the problem of autonomously discovering behavi...
research
10/03/2022

Interpretable Option Discovery using Deep Q-Learning and Variational Autoencoders

Deep Reinforcement Learning (RL) is unquestionably a robust framework to...
research
01/19/2020

Learning Options from Demonstration using Skill Segmentation

We present a method for learning options from segmented demonstration tr...
research
09/05/2022

MO2: Model-Based Offline Options

The ability to discover useful behaviours from past experience and trans...
research
08/22/2019

A Decomposition and Metric-Based Evaluation Framework for Microservices

Migrating from monolithic systems into microservice is a very complex ta...

Please sign up or login with your details

Forgot password? Click here to reset