Learning to Generalize Across Long-Horizon Tasks from Human Demonstrations

03/13/2020
by   Ajay Mandlekar, et al.
3

Imitation learning is an effective and safe technique to train robot policies in the real world because it does not depend on an expensive random exploration process. However, due to the lack of exploration, learning policies that generalize beyond the demonstrated behaviors is still an open challenge. We present a novel imitation learning framework to enable robots to 1) learn complex real world manipulation tasks efficiently from a small number of human demonstrations, and 2) synthesize new behaviors not contained in the collected demonstrations. Our key insight is that multi-task domains often present a latent structure, where demonstrated trajectories for different tasks intersect at common regions of the state space. We present Generalization Through Imitation (GTI), a two-stage offline imitation learning algorithm that exploits this intersecting structure to train goal-directed policies that generalize to unseen start and goal state combinations. In the first stage of GTI, we train a stochastic policy that leverages trajectory intersections to have the capacity to compose behaviors from different demonstration trajectories together. In the second stage of GTI, we collect a small set of rollouts from the unconditioned stochastic policy of the first stage, and train a goal-directed agent to generalize to novel start and goal configurations. We validate GTI in both simulated domains and a challenging long-horizon robotic manipulation domain in the real world. Additional results and videos are available at https://sites.google.com/view/gti2020/ .

READ FULL TEXT

page 1

page 2

page 5

page 6

page 7

page 8

research
09/28/2021

Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation

We tackle real-world long-horizon robot manipulation tasks through skill...
research
10/25/2019

Relay Policy Learning: Solving Long-Horizon Tasks via Imitation and Reinforcement Learning

We present relay policy learning, a method for imitation and reinforceme...
research
07/26/2023

Waypoint-Based Imitation Learning for Robotic Manipulation

While imitation learning methods have seen a resurgent interest for robo...
research
10/16/2018

Multiple Interactions Made Easy (MIME): Large Scale Demonstrations Data for Imitation

In recent years, we have seen an emergence of data-driven approaches in ...
research
12/05/2022

Accelerating Interactive Human-like Manipulation Learning with GPU-based Simulation and High-quality Demonstrations

Dexterous manipulation with anthropomorphic robot hands remains a challe...
research
12/09/2021

Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation

In mobile manipulation (MM), robots can both navigate within and interac...
research
10/14/2022

Abstract-to-Executable Trajectory Translation for One-Shot Task Generalization

Training long-horizon robotic policies in complex physical environments ...

Please sign up or login with your details

Forgot password? Click here to reset