RT-1: Robotics Transformer for Real-World Control at Scale

12/13/2022
∙
by   Anthony Brohan, et al.
∙
0
∙

By transferring knowledge from large, diverse, task-agnostic datasets, modern machine learning models can solve specific downstream tasks either zero-shot or with small task-specific datasets to a high level of performance. While this capability has been demonstrated in other fields such as computer vision, natural language processing or speech recognition, it remains to be shown in robotics, where the generalization capabilities of the models are particularly critical due to the difficulty of collecting real-world robotic data. We argue that one of the keys to the success of such general robotic models lies with open-ended task-agnostic training, combined with high-capacity architectures that can absorb all of the diverse, robotic data. In this paper, we present a model class, dubbed Robotics Transformer, that exhibits promising scalable model properties. We verify our conclusions in a study of different model classes and their ability to generalize as a function of the data size, model size, and data diversity based on a large-scale data collection on real robots performing real-world tasks. The project's website and videos can be found at robotics-transformer.github.io

READ FULL TEXT

page 2

page 5

page 9

page 11

page 22

page 24

page 26

page 31

research
∙ 03/31/2021

Learning Generalizable Robotic Reward Functions from "In-The-Wild" Human Videos

We are motivated by the goal of generalist robots that can complete a wi...
research
∙ 09/18/2023

Prompt a Robot to Walk with Large Language Models

Large language models (LLMs) pre-trained on vast internet-scale data hav...
research
∙ 12/09/2022

PATO: Policy Assisted TeleOperation for Scalable Robot Data Collection

Large-scale data is an essential component of machine learning as demons...
research
∙ 09/16/2023

Pour me a drink: Robotic Precision Pouring Carbonated Beverages into Transparent Containers

With the growing emphasis on the development and integration of service ...
research
∙ 10/06/2022

Generalization Properties of Retrieval-based Models

Many modern high-performing machine learning models such as GPT-3 primar...
research
∙ 06/02/2023

Unifying (Machine) Vision via Counterfactual World Modeling

Leading approaches in machine vision employ different architectures for ...
research
∙ 06/02/2023

Probabilistic Adaptation of Text-to-Video Models

Large text-to-video models trained on internet-scale data have demonstra...

Please sign up or login with your details

Forgot password? Click here to reset