Behavior Retrieval: Few-Shot Imitation Learning by Querying Unlabeled Datasets

04/18/2023
by   Maximilian Du, et al.
4

Enabling robots to learn novel visuomotor skills in a data-efficient manner remains an unsolved problem with myriad challenges. A popular paradigm for tackling this problem is through leveraging large unlabeled datasets that have many behaviors in them and then adapting a policy to a specific task using a small amount of task-specific human supervision (i.e. interventions or demonstrations). However, how best to leverage the narrow task-specific supervision and balance it with offline data remains an open question. Our key insight in this work is that task-specific data not only provides new data for an agent to train on but can also inform the type of prior data the agent should use for learning. Concretely, we propose a simple approach that uses a small amount of downstream expert data to selectively query relevant behaviors from an offline, unlabeled dataset (including many sub-optimal behaviors). The agent is then jointly trained on the expert and queried data. We observe that our method learns to query only the relevant transitions to the task, filtering out sub-optimal or task-irrelevant data. By doing so, it is able to learn more effectively from the mix of task-specific and offline data compared to naively mixing the data or only using the task-specific data. Furthermore, we find that our simple querying approach outperforms more complex goal-conditioned methods by 20 https://sites.google.com/view/behaviorretrieval for videos and code.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 8

page 14

research
11/27/2020

Offline Learning from Demonstrations and Unlabeled Experience

Behavior cloning (BC) is often practical for robot learning because it a...
research
03/28/2022

Modular Adaptive Policy Selection for Multi-Task Imitation Learning through Task Division

Deep imitation learning requires many expert demonstrations, which can b...
research
04/26/2023

Distance Weighted Supervised Learning for Offline Interaction Data

Sequential decision making algorithms often struggle to leverage differe...
research
07/19/2021

Playful Interactions for Representation Learning

One of the key challenges in visual imitation learning is collecting lar...
research
01/19/2022

Improving Behavioural Cloning with Human-Driven Dynamic Dataset Augmentation

Behavioural cloning has been extensively used to train agents and is rec...
research
06/09/2021

Pretraining Representations for Data-Efficient Reinforcement Learning

Data efficiency is a key challenge for deep reinforcement learning. We a...
research
02/21/2023

Inferring Implicit Trait Preferences for Task Allocation in Heterogeneous Teams

Task allocation in heterogeneous multi-agent teams often requires reason...

Please sign up or login with your details

Forgot password? Click here to reset