Scaling Robot Learning with Semantically Imagined Experience

02/22/2023
by   Tianhe Yu, et al.
0

Recent advances in robot learning have shown promise in enabling robots to perform a variety of manipulation tasks and generalize to novel scenarios. One of the key contributing factors to this progress is the scale of robot data used to train the models. To obtain large-scale datasets, prior approaches have relied on either demonstrations requiring high human involvement or engineering-heavy autonomous data collection schemes, both of which are challenging to scale. To mitigate this issue, we propose an alternative route and leverage text-to-image foundation models widely used in computer vision and natural language processing to obtain meaningful data for robot learning without requiring additional robot data. We term our method Robot Learning with Semantically Imagened Experience (ROSIE). Specifically, we make use of the state of the art text-to-image diffusion models and perform aggressive data augmentation on top of our existing robotic manipulation datasets via inpainting various unseen objects for manipulation, backgrounds, and distractors with text guidance. Through extensive real-world experiments, we show that manipulation policies trained on data augmented this way are able to solve completely unseen tasks with new objects and can behave more robustly w.r.t. novel distractors. In addition, we find that we can improve the robustness and generalization of high-level robot learning tasks such as success detection through training with the diffusion-based data augmentation. The project's website and videos can be found at diffusion-rosie.github.io

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 8

page 19

page 20

research
12/14/2020

A Framework for Efficient Robotic Manipulation

Data-efficient learning of manipulation policies from visual observation...
research
10/05/2022

DALL-E-Bot: Introducing Web-Scale Diffusion Models to Robotics

We introduce the first work to explore web-scale diffusion models for ro...
research
05/05/2022

Data Augmentation for Manipulation

The success of deep learning depends heavily on the availability of larg...
research
10/14/2022

ExAug: Robot-Conditioned Navigation Policies via Geometric Experience Augmentation

Machine learning techniques rely on large and diverse datasets for gener...
research
11/08/2022

StructDiffusion: Object-Centric Diffusion for Semantic Rearrangement of Novel Objects

Robots operating in human environments must be able to rearrange objects...
research
03/26/2020

Fashion Landmark Detection and Category Classification for Robotics

Research on automated, image based identification of clothing categories...
research
11/11/2019

Scaling Robot Supervision to Hundreds of Hours with RoboTurk: Robotic Manipulation Dataset through Human Reasoning and Dexterity

Large, richly annotated datasets have accelerated progress in fields suc...

Please sign up or login with your details

Forgot password? Click here to reset