Recurrent Environment Simulators

04/07/2017
by   Silvia Chiappa, et al.
0

Models that can simulate how environments change in response to actions can be used by agents to plan and act efficiently. We improve on previous environment simulators from high-dimensional pixel observations by introducing recurrent neural networks that are able to make temporally and spatially coherent predictions for hundreds of time-steps into the future. We present an in-depth analysis of the factors affecting performance, providing the most extensive attempt to advance the understanding of the properties of these models. We address the issue of computationally inefficiency with a model that does not need to generate a high-dimensional image at each time-step. We show that our approach can be used to improve exploration and is adaptable to many diverse environments, namely 10 Atari games, a 3D car racing environment, and complex 3D mazes.

READ FULL TEXT

page 11

page 12

page 13

page 18

page 19

research
07/31/2015

Action-Conditional Video Prediction using Deep Networks in Atari Games

Motivated by vision-based reinforcement learning (RL) problems, in parti...
research
03/25/2022

StretchBEV: Stretching Future Instance Prediction Spatially and Temporally

In self-driving, predicting future in terms of location and motion of al...
research
11/23/2022

Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning

The latent world model provides a promising way to learn policies in a c...
research
11/21/2019

Convolutional Mixture Density Recurrent Neural Network for Predicting User Location with WiFi Fingerprints

Predicting smartphone users activity using WiFi fingerprints has been a ...
research
02/07/2020

Causally Correct Partial Models for Reinforcement Learning

In reinforcement learning, we can learn a model of future observations a...
research
03/29/2019

Deep, spatially coherent Inverse Sensor Models with Uncertainty Incorporation using the evidential Framework

To perform high speed tasks, sensors of autonomous cars have to provide ...
research
11/06/2016

Learning to Act by Predicting the Future

We present an approach to sensorimotor control in immersive environments...

Please sign up or login with your details

Forgot password? Click here to reset