DeepAI AI Chat
Log In Sign Up

Guided Imitation of Task and Motion Planning

12/06/2021
by   Michael James McDonald, et al.
MIT
berkeley college
0

While modern policy optimization methods can do complex manipulation from sensory data, they struggle on problems with extended time horizons and multiple sub-goals. On the other hand, task and motion planning (TAMP) methods scale to long horizons but they are computationally expensive and need to precisely track world state. We propose a method that draws on the strength of both methods: we train a policy to imitate a TAMP solver's output. This produces a feed-forward policy that can accomplish multi-step tasks from sensory data. First, we build an asynchronous distributed TAMP solver that can produce supervision data fast enough for imitation learning. Then, we propose a hierarchical policy architecture that lets us use partially trained control policies to speed up the TAMP solver. In robotic manipulation tasks with 7-DoF joint control, the partially trained policies reduce the time needed for planning by a factor of up to 2.6. Among these tasks, we can learn a policy that solves the RoboSuite 4-object pick-place task 88 pose observations and a policy that solves the RoboDesk 9-goal benchmark 79 the time from RGB images (averaged across the 9 disparate tasks).

READ FULL TEXT
10/25/2022

Policy-Guided Lazy Search with Feedback for Task and Motion Planning

PDDLStream solvers have recently emerged as viable solutions for Task an...
03/25/2021

Self-Imitation Learning by Planning

Imitation learning (IL) enables robots to acquire skills quickly by tran...
11/11/2021

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation

Learning complex manipulation tasks in realistic, obstructed environment...
11/06/2020

Task-relevant Representation Learning for Networked Robotic Perception

Today, even the most compute-and-power constrained robots can measure co...
08/04/2021

Tolerance-Guided Policy Learning for Adaptable and Transferrable Delicate Industrial Insertion

Policy learning for delicate industrial insertion tasks (e.g., PC board ...
09/12/2022

HandMime: Sign Language Fingerspelling Acquisition via Imitation Learning

Learning fine-grained movements is among the most challenging topics in ...