Near-Optimal Representation Learning for Hierarchical Reinforcement Learning

10/02/2018
by   Ofir Nachum, et al.
12

We study the problem of representation learning in goal-conditioned hierarchical reinforcement learning. In such hierarchical structures, a higher-level controller solves tasks by iteratively communicating goals which a lower-level policy is trained to reach. Accordingly, the choice of representation -- the mapping of observation space to goal space -- is crucial. To study this problem, we develop a notion of sub-optimality of a representation, defined in terms of expected reward of the optimal hierarchical policy using this representation. We derive expressions which bound the sub-optimality and show how these expressions can be translated to representation learning objectives which may be optimized in practice. Results on a number of difficult continuous-control tasks show that our approach to representation learning yields qualitatively better representations as well as quantitatively better hierarchical policies, compared to existing methods (see videos at https://sites.google.com/view/representation-hrl).

READ FULL TEXT

page 13

page 14

research
05/31/2021

Efficient Hierarchical Exploration with Stable Subgoal Representation Learning

Goal-conditioned hierarchical reinforcement learning (HRL) serves as a s...
research
11/19/2018

Learning Actionable Representations with Goal-Conditioned Policies

Representation learning is a central challenge across a range of machine...
research
05/04/2019

Hierarchical Policy Learning is Sensitive to Goal Space Design

Hierarchy in reinforcement learning agents allows for control at multipl...
research
11/22/2018

Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning

In hierarchical reinforcement learning a major challenge is determining ...
research
06/13/2019

Sub-policy Adaptation for Hierarchical Reinforcement Learning

Hierarchical Reinforcement Learning is a promising approach to long-hori...
research
04/09/2018

Latent Space Policies for Hierarchical Reinforcement Learning

We address the problem of learning hierarchical deep neural network poli...
research
06/01/2013

Universal Induction with Varying Sets of Combinators

Universal induction is a crucial issue in AGI. Its practical applicabili...

Please sign up or login with your details

Forgot password? Click here to reset