Generative Temporal Models with Spatial Memory for Partially Observed Environments

04/25/2018
by   Marco Fraccaro, et al.
0

In model-based reinforcement learning, generative and temporal models of environments can be leveraged to boost agent performance, either by tuning the agent's representations during training or via use as part of an explicit planning mechanism. However, their application in practice has been limited to simplistic environments, due to the difficulty of training such models in larger, potentially partially-observed and 3D environments. In this work we introduce a novel action-conditioned generative model of such challenging environments. The model features a non-parametric spatial memory system in which we store learned, disentangled representations of the environment. Low-dimensional spatial updates are computed using a state-space model that makes use of knowledge on the prior dynamics of the moving agent, and high-dimensional visual observations are modelled with a Variational Auto-Encoder. The result is a scalable architecture capable of performing coherent predictions over hundreds of time steps across a range of partially observed 2D and 3D environments.

READ FULL TEXT

page 6

page 7

page 12

research
06/08/2018

Temporal Difference Variational Auto-Encoder

One motivation for learning generative models of environments is to use ...
research
11/09/2021

Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning

Identifying uncertainty and taking mitigating actions is crucial for saf...
research
06/22/2019

A neurally plausible model learns successor representations in partially observable environments

Animals need to devise strategies to maximize returns while interacting ...
research
12/07/2021

Information is Power: Intrinsic Control via Information Capture

Humans and animals explore their environment and acquire useful skills e...
research
06/17/2019

Visual Navigation by Generating Next Expected Observations

We propose a novel approach to visual navigation in unknown environments...
research
06/08/2021

Vector Quantized Models for Planning

Recent developments in the field of model-based RL have proven successfu...
research
01/16/2014

Learning to Make Predictions In Partially Observable Environments Without a Generative Model

When faced with the problem of learning a model of a high-dimensional en...

Please sign up or login with your details

Forgot password? Click here to reset