Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition

07/26/2023
by   Huy Ha, et al.
0

We present a framework for robot skill acquisition, which 1) efficiently scale up data generation of language-labelled robot data and 2) effectively distills this data down into a robust multi-task language-conditioned visuo-motor policy. For (1), we use a large language model (LLM) to guide high-level planning, and sampling-based robot planners (e.g. motion or grasp samplers) for generating diverse and rich manipulation trajectories. To robustify this data-collection process, the LLM also infers a code-snippet for the success condition of each task, simultaneously enabling the data-collection process to detect failure and retry as well as the automatic labeling of trajectories with success/failure. For (2), we extend the diffusion policy single-task behavior-cloning approach to multi-task settings with language conditioning. Finally, we propose a new multi-task benchmark with 18 tasks across five domains to test long-horizon behavior, common-sense reasoning, tool-use, and intuitive physics. We find that our distilled policy successfully learned the robust retrying behavior in its data collection policy, while improving absolute success rates by 34.8 benchmark, code, and qualitative results are on our website https://www.cs.columbia.edu/ huy/scalingup/

READ FULL TEXT

page 3

page 7

page 15

page 16

page 17

page 19

page 20

page 26

research
08/19/2023

Skill Transformer: A Monolithic Policy for Mobile Manipulation

We present Skill Transformer, an approach for solving long-horizon robot...
research
12/09/2022

PATO: Policy Assisted TeleOperation for Scalable Robot Data Collection

Large-scale data is an essential component of machine learning as demons...
research
09/19/2021

Lifelong Robotic Reinforcement Learning by Retaining Experiences

Multi-task learning ideally allows robots to acquire a diverse repertoir...
research
03/21/2023

Text2Motion: From Natural Language Instructions to Feasible Plans

We propose Text2Motion, a language-based planning framework enabling rob...
research
09/15/2021

Multi-Task Learning with Sequence-Conditioned Transporter Networks

Enabling robots to solve multiple manipulation tasks has a wide range of...
research
03/05/2019

Learning Latent Plans from Play

We propose learning from teleoperated play data (LfP) as a way to scale ...
research
07/31/2023

Generative models for wearables data

Data scarcity is a common obstacle in medical research due to the high c...

Please sign up or login with your details

Forgot password? Click here to reset