Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

07/01/2019
by   Wenling Shang, et al.
5

In many real-world scenarios, an autonomous agent often encounters various tasks within a single complex environment. We propose to build a graph abstraction over the environment structure to accelerate the learning of these tasks. Here, nodes are important points of interest (pivotal states) and edges represent feasible traversals between them. Our approach has two stages. First, we jointly train a latent pivotal state model and a curiosity-driven goal-conditioned policy in a task-agnostic manner. Second, provided with the information from the world graph, a high-level Manager quickly finds solution to new tasks and expresses subgoals in reference to pivotal states to a low-level Worker. The Worker can then also leverage the graph to easily traverse to the pivotal states of interest, even across long distance, and explore non-locally. We perform a thorough ablation study to evaluate our approach on a suite of challenging maze tasks, demonstrating significant advantages from the proposed framework over baselines that lack world graph knowledge in terms of performance and efficiency.

READ FULL TEXT
research
06/09/2022

Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning

World models in model-based reinforcement learning usually face unrealis...
research
10/11/2022

DHRL: A Graph-Based Approach for Long-Horizon and Sparse Hierarchical Reinforcement Learning

Hierarchical Reinforcement Learning (HRL) has made notable progress in c...
research
03/22/2019

Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction

The recommender system is an important form of intelligent application, ...
research
07/01/2021

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Goal-conditioned reinforcement learning endows an agent with a large var...
research
09/22/2020

Learning Task-Agnostic Action Spaces for Movement Optimization

We propose a novel method for exploring the dynamics of physically based...
research
10/31/2019

Object-oriented state editing for HRL

We introduce agents that use object-oriented reasoning to consider alter...
research
03/01/2018

Composable Planning with Attributes

The tasks that an agent will need to solve often are not known during tr...

Please sign up or login with your details

Forgot password? Click here to reset