Visual Navigation by Generating Next Expected Observations

06/17/2019
by   Qiaoyun Wu, et al.
5

We propose a novel approach to visual navigation in unknown environments where the agent is guided by conceiving the next observations it expects to see after taking the next best action. This is achieved by learning a variational Bayesian model that generates the next expected observations (NEO) conditioned on the current observations of the agent and the target view. Our approach predicts the next best action based on the current observation and NEO. Our generative model is learned through optimizing a variational objective encompassing two key designs. First, the latent distribution is conditioned on current observations and target view, supporting model-based, target-driven navigation. Second, the latent space is modeled with a Mixture of Gaussians conditioned on the current observation and next best action. Our use of mixture-of-posteriors prior effectively alleviates the issue of over-regularized latent space, thus facilitating model generalization in novel scenes. Moreover, the NEO generation models the forward dynamics of the agent-environment interaction, which improves the quality of approximate inference and hence benefits data efficiency. We have conducted extensive evaluations on both real-world and synthetic benchmarks, and show that our model outperforms the state-of-the-art RL-based methods significantly in terms of success rate, data efficiency, and cross-scene generalization.

READ FULL TEXT
research
12/09/2019

Reinforcement Learning based Visual Navigation with Information-Theoretic Regularization

We present a target-driven navigation approach for improving the cross-t...
research
09/30/2020

Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning

We present a target-driven navigation system to improve mapless visual n...
research
01/18/2021

Mind the Gap when Conditioning Amortised Inference in Sequential Latent-Variable Models

Amortised inference enables scalable learning of sequential latent-varia...
research
02/22/2020

Emergent Communication with World Models

We introduce Language World Models, a class of language-conditional gene...
research
08/09/2021

Mapless Humanoid Navigation Using Learned Latent Dynamics

In this paper, we propose a novel Deep Reinforcement Learning approach t...
research
07/15/2022

Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a Jointly Trained Generative Latent Space

We present a novel generative method for producing unseen and plausible ...
research
04/25/2018

Generative Temporal Models with Spatial Memory for Partially Observed Environments

In model-based reinforcement learning, generative and temporal models of...

Please sign up or login with your details

Forgot password? Click here to reset