Guided Imitation of Task and Motion Planning

12/06/2021
by   Michael James McDonald, et al.
0

While modern policy optimization methods can do complex manipulation from sensory data, they struggle on problems with extended time horizons and multiple sub-goals. On the other hand, task and motion planning (TAMP) methods scale to long horizons but they are computationally expensive and need to precisely track world state. We propose a method that draws on the strength of both methods: we train a policy to imitate a TAMP solver's output. This produces a feed-forward policy that can accomplish multi-step tasks from sensory data. First, we build an asynchronous distributed TAMP solver that can produce supervision data fast enough for imitation learning. Then, we propose a hierarchical policy architecture that lets us use partially trained control policies to speed up the TAMP solver. In robotic manipulation tasks with 7-DoF joint control, the partially trained policies reduce the time needed for planning by a factor of up to 2.6. Among these tasks, we can learn a policy that solves the RoboSuite 4-object pick-place task 88 pose observations and a policy that solves the RoboDesk 9-goal benchmark 79 the time from RGB images (averaged across the 9 disparate tasks).

READ FULL TEXT
research
05/25/2023

Imitating Task and Motion Planning with Visuomotor Transformers

Imitation learning is a powerful tool for training robot manipulation po...
research
10/25/2022

Policy-Guided Lazy Search with Feedback for Task and Motion Planning

PDDLStream solvers have recently emerged as viable solutions for Task an...
research
03/25/2021

Self-Imitation Learning by Planning

Imitation learning (IL) enables robots to acquire skills quickly by tran...
research
02/22/2023

A Supervisory Learning Control Framework for Autonomous Real-time Task Planning for an Underactuated Cooperative Robotic task

We introduce a framework for cooperative manipulation, applied on an und...
research
05/17/2022

Self-Supervised Learning of Multi-Object Keypoints for Robotic Manipulation

In recent years, policy learning methods using either reinforcement or i...
research
11/06/2020

Task-relevant Representation Learning for Networked Robotic Perception

Today, even the most compute-and-power constrained robots can measure co...
research
08/04/2021

Tolerance-Guided Policy Learning for Adaptable and Transferrable Delicate Industrial Insertion

Policy learning for delicate industrial insertion tasks (e.g., PC board ...

Please sign up or login with your details

Forgot password? Click here to reset