Learning Accurate Extended-Horizon Predictions of High Dimensional Trajectories

01/12/2019
by   Brian Gaudet, et al.
0

We present a novel predictive model architecture based on the principles of predictive coding that enables open loop prediction of future observations over extended horizons. There are two key innovations. First, whereas current methods typically learn to make long-horizon open-loop predictions using a multi-step cost function, we instead run the model open loop in the forward pass during training. Second, current predictive coding models initialize the representation layer's hidden state to a constant value at the start of an episode, and consequently typically require multiple steps of interaction with the environment before the model begins to produce accurate predictions. Instead, we learn a mapping from the first observation in an episode to the hidden state, allowing the trained model to immediately produce accurate predictions. We compare the performance of our architecture to a standard predictive coding model and demonstrate the ability of the model to make accurate long horizon open-loop predictions of simulated Doppler radar altimeter readings during a six degree of freedom Mars landing. Finally, we demonstrate a 2X reduction in sample complexity by using the model to implement a Dyna style algorithm to accelerate policy learning with proximal policy optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2009

Closing the Learning-Planning Loop with Predictive State Representations

A central problem in artificial intelligence is that of planning to maxi...
research
12/14/2021

Learning to track environment state via predictive autoencoding

This work introduces a neural architecture for learning forward models o...
research
02/08/2015

From Pixels to Torques: Policy Learning with Deep Dynamical Models

Data-efficient learning in continuous state-action spaces using very hig...
research
06/05/2018

The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces

Dyna is an architecture for reinforcement learning agents that interleav...
research
04/26/2022

A Gaussian Process Model for Opponent Prediction in Autonomous Racing

In head-to-head racing, performing tightly constrained, but highly rewar...
research
12/22/2022

Predictive Coding Based Multiscale Network with Encoder-Decoder LSTM for Video Prediction

We are introducing a multi-scale predictive model for video prediction h...
research
07/11/2019

Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning

We propose Stable Yet Memory Bounded Open-Loop (SYMBOL) planning, a gene...

Please sign up or login with your details

Forgot password? Click here to reset