Learning to Compose Skills

11/30/2017
by   Himanshu Sahni, et al.
0

We present a differentiable framework capable of learning a wide variety of compositions of simple policies that we call skills. By recursively composing skills with themselves, we can create hierarchies that display complex behavior. Skill networks are trained to generate skill-state embeddings that are provided as inputs to a trainable composition function, which in turn outputs a policy for the overall task. Our experiments on an environment consisting of multiple collect and evade tasks show that this architecture is able to quickly build complex skills from simpler ones. Furthermore, the learned composition function displays some transfer to unseen combinations of skills, allowing for zero-shot generalizations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/04/2018

Zero-Shot Skill Composition and Simulation-to-Real Transfer by Learning Task Representations

Simulation-to-real transfer is an important strategy for making reinforc...
research
01/18/2020

Developing and Validating an Interactive Training Tool for Inferring 2D Cross-Sections of Complex 3D Structures

Understanding 2D cross-sections of 3D structures is a crucial skill in m...
research
07/20/2020

Complex Skill Acquisition through Simple Skill Adversarial Imitation Learning

Humans are able to think of complex tasks as combinations of simpler sub...
research
11/15/2021

Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization

Skill chaining is a promising approach for synthesizing complex behavior...
research
05/25/2022

Skill Machines: Temporal Logic Composition in Reinforcement Learning

A major challenge in reinforcement learning is specifying tasks in a man...
research
05/23/2019

MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies

Humans are able to perform a myriad of sophisticated tasks by drawing up...
research
06/01/2022

Learning to Sequence and Blend Robot Skills via Differentiable Optimization

In contrast to humans and animals who naturally execute seamless motions...

Please sign up or login with your details

Forgot password? Click here to reset