Towards Exploiting Geometry and Time for Fast Off-Distribution Adaptation in Multi-Task Robot Learning

06/24/2021
by   K. R. Zentner, et al.
0

We explore possible methods for multi-task transfer learning which seek to exploit the shared physical structure of robotics tasks. Specifically, we train policies for a base set of pre-training tasks, then experiment with adapting to new off-distribution tasks, using simple architectural approaches for re-using these policies as black-box priors. These approaches include learning an alignment of either the observation space or action space from a base to a target task to exploit rigid body structure, and methods for learning a time-domain switching policy across base tasks which solves the target task, to exploit temporal coherence. We find that combining low-complexity target policy classes, base policies as black-box priors, and simple optimization algorithms allows us to acquire new tasks outside the base task distribution, using small amounts of offline training data.

READ FULL TEXT
research
09/20/2017

Using Parameterized Black-Box Priors to Scale Up Model-Based Policy Search for Robotics

The most data-efficient algorithms for reinforcement learning in robotic...
research
10/14/2021

Can Explanations Be Useful for Calibrating Black Box Models?

One often wants to take an existing, trained NLP model and use it on dat...
research
03/30/2020

Multi-Task Reinforcement Learning with Soft Modularization

Multi-task learning is a very challenging problem in reinforcement learn...
research
05/20/2022

Task Relabelling for Multi-task Transfer using Successor Features

Deep Reinforcement Learning has been very successful recently with vario...
research
02/14/2022

ASC me to Do Anything: Multi-task Training for Embodied AI

Embodied AI has seen steady progress across a diverse set of independent...
research
06/22/2022

Generative Pretraining for Black-Box Optimization

Many problems in science and engineering involve optimizing an expensive...
research
03/04/2021

Toward Robust Long Range Policy Transfer

Humans can master a new task within a few trials by drawing upon skills ...

Please sign up or login with your details

Forgot password? Click here to reset