Learning Quickly to Plan Quickly Using Modular Meta-Learning

09/20/2018
by   Rohan Chitnis, et al.
0

Multi-object manipulation problems in continuous state and action spaces can be solved by planners that search over sampled values for the continuous parameters of operators. The efficiency of these planners depends critically on the effectiveness of the samplers used, but effective sampling in turn depends on details of the robot, environment, and task. Our strategy is to learn functions called specializers that generate values for continuous operator parameters, given a state description and values for the discrete parameters. Rather than trying to learn a single specializer for each operator from large amounts of data on a single task, we take a modular meta-learning approach. We train on multiple tasks and learn a variety of specializers that, on a new task, can be quickly adapted using relatively little data -- thus, our system "learns quickly to plan quickly" using these specializers. We validate our approach experimentally in simulated 3D pick-and-place tasks with continuous state and action spaces.

READ FULL TEXT
research
03/25/2020

iTAML: An Incremental Task-Agnostic Meta-learning Approach

Humans can continuously learn new knowledge as their experience grows. I...
research
03/25/2021

A Meta-Reinforcement Learning Approach to Process Control

Meta-learning is a branch of machine learning which aims to quickly adap...
research
07/11/2017

Meta-Learning with Temporal Convolutions

Deep neural networks excel in regimes with large amounts of data, but te...
research
11/18/2021

Visual Goal-Directed Meta-Learning with Contextual Planning Networks

The goal of meta-learning is to generalize to new tasks and goals as qui...
research
12/19/2018

Modular meta-learning in abstract graph networks for combinatorial generalization

Modular meta-learning is a new framework that generalizes to unseen data...
research
03/04/2019

Model Primitive Hierarchical Lifelong Reinforcement Learning

Learning interpretable and transferable subpolicies and performing task ...
research
11/20/2017

Modular Continual Learning in a Unified Visual Environment

A core aspect of human intelligence is the ability to learn new tasks qu...

Please sign up or login with your details

Forgot password? Click here to reset