Hierarchical Imitation Learning with Vector Quantized Models

01/30/2023
by   Kalle Kujanpää, et al.
0

The ability to plan actions on multiple levels of abstraction enables intelligent agents to solve complex tasks effectively. However, learning the models for both low and high-level planning from demonstrations has proven challenging, especially with higher-dimensional inputs. To address this issue, we propose to use reinforcement learning to identify subgoals in expert trajectories by associating the magnitude of the rewards with the predictability of low-level actions given the state and the chosen subgoal. We build a vector-quantized generative model for the identified subgoals to perform subgoal-level planning. In experiments, the algorithm excels at solving complex, long-horizon decision-making problems outperforming state-of-the-art. Because of its ability to plan, our algorithm can find better trajectories than the ones in the training set

READ FULL TEXT

page 3

page 5

page 9

page 15

page 19

page 21

research
10/09/2021

Interactive Hierarchical Guidance using Language

Reinforcement learning has been successful in many tasks ranging from ro...
research
06/10/2023

PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical Reinforcement Learning

Hierarchical reinforcement learning (HRL) has the potential to solve com...
research
07/03/2019

Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Generative Model

Integration of reinforcement learning and imitation learning is an impor...
research
08/19/2022

Home Run: Finding Your Way Home by Imagining Trajectories

When studying unconstrained behaviour and allowing mice to leave their c...
research
07/29/2013

Levels of Integration between Low-Level Reasoning and Task Planning

We provide a systematic analysis of levels of integration between discre...
research
03/01/2018

Composable Planning with Attributes

The tasks that an agent will need to solve often are not known during tr...
research
02/21/2022

CCPT: Automatic Gameplay Testing and Validation with Curiosity-Conditioned Proximal Trajectories

This paper proposes a novel deep reinforcement learning algorithm to per...

Please sign up or login with your details

Forgot password? Click here to reset