Hallucinative Topological Memory for Zero-Shot Visual Planning

02/27/2020
by   Kara Liu, et al.
10

In visual planning (VP), an agent learns to plan goal-directed behavior from observations of a dynamical system obtained offline, e.g., images obtained from self-supervised robot interaction. Most previous works on VP approached the problem by planning in a learned latent space, resulting in low-quality visual plans, and difficult training algorithms. Here, instead, we propose a simple VP method that plans directly in image space and displays competitive performance. We build on the semi-parametric topological memory (SPTM) method: image samples are treated as nodes in a graph, the graph connectivity is learned from image sequence data, and planning can be performed using conventional graph search methods. We propose two modifications on SPTM. First, we train an energy-based graph connectivity function using contrastive predictive coding that admits stable training. Second, to allow zero-shot planning in new domains, we learn a conditional VAE model that generates images given a context of the domain, and use these hallucinated samples for building the connectivity graph and planning. We show that this simple approach significantly outperform the state-of-the-art VP methods, in terms of both plan interpretability and success rate when using the plan to guide a trajectory-following controller. Interestingly, our method can pick up non-trivial visual properties of objects, such as their geometry, and account for it in the plans.

READ FULL TEXT

page 6

page 7

page 12

research
07/24/2018

Learning Plannable Representations with Causal InfoGAN

In recent years, deep generative models have been shown to 'imagine' con...
research
06/20/2023

Plausibility-Based Heuristics for Latent Space Classical Planning

Recent work on LatPlan has shown that it is possible to learn models for...
research
05/11/2019

Learning Robotic Manipulation through Visual Planning and Acting

Planning for robotic manipulation requires reasoning about the changes a...
research
03/01/2018

Semi-parametric Topological Memory for Navigation

We introduce a new memory architecture for navigation in previously unse...
research
11/03/2022

Sequence-Based Plan Feasibility Prediction for Efficient Task and Motion Planning

Robots planning long-horizon behavior in complex environments must be ab...
research
05/02/2023

Multimodal Procedural Planning via Dual Text-Image Prompting

Embodied agents have achieved prominent performance in following human i...
research
10/17/2019

Scoring-Aggregating-Planning: Learning task-agnostic priors from interactions and sparse rewards for zero-shot generalization

Humans can learn task-agnostic priors from interactive experience and ut...

Please sign up or login with your details

Forgot password? Click here to reset