DeepAI AI Chat
Log In Sign Up

Learning Task Decomposition with Ordered Memory Policy Network

03/19/2021
by   Yuchen Lu, et al.
35

Many complex real-world tasks are composed of several levels of sub-tasks. Humans leverage these hierarchical structures to accelerate the learning process and achieve better generalization. In this work, we study the inductive bias and propose Ordered Memory Policy Network (OMPN) to discover subtask hierarchy by learning from demonstration. The discovered subtask hierarchy could be used to perform task decomposition, recovering the subtask boundaries in an unstruc-tured demonstration. Experiments on Craft and Dial demonstrate that our modelcan achieve higher task decomposition performance under both unsupervised and weakly supervised settings, comparing with strong baselines. OMPN can also bedirectly applied to partially observable environments and still achieve higher task decomposition performance. Our visualization further confirms that the subtask hierarchy can emerge in our model.

READ FULL TEXT

page 7

page 8

page 14

page 15

page 20

03/02/2018

TACO: Learning Task Decomposition via Temporal Alignment for Control

Many advanced Learning from Demonstration (LfD) methods consider the dec...
11/26/2020

Totally and Partially Ordered Hierarchical Planners in PDDL4J Library

In this paper, we outline the implementation of the TFD (Totally Ordered...
10/06/2020

Unsupervised Hierarchical Concept Learning

Discovering concepts (or temporal abstractions) in an unsupervised manne...
05/19/2023

Characterizing tradeoffs between teaching via language and demonstrations in multi-agent systems

Humans teach others about the world through language and demonstration. ...
06/22/2022

Learning Representations for Control with Hierarchical Forward Models

Learning control from pixels is difficult for reinforcement learning (RL...
04/28/2023

Unsupervised Discovery of 3D Hierarchical Structure with Generative Diffusion Features

Inspired by recent findings that generative diffusion models learn seman...
05/01/2023

Model-agnostic Measure of Generalization Difficulty

The measure of a machine learning algorithm is the difficulty of the tas...