Modeling sequential data using higher-order relational features and predictive training

02/10/2014
by   Vincent Michalski, et al.
0

Bi-linear feature learning models, like the gated autoencoder, were proposed as a way to model relationships between frames in a video. By minimizing reconstruction error of one frame, given the previous frame, these models learn "mapping units" that encode the transformations inherent in a sequence, and thereby learn to encode motion. In this work we extend bi-linear models by introducing "higher-order mapping units" that allow us to encode transformations between frames and transformations between transformations. We show that this makes it possible to encode temporal structure that is more complex and longer-range than the structure captured within standard bi-linear models. We also show that a natural way to train the model is by replacing the commonly used reconstruction objective with a prediction objective which forces the model to correctly predict the evolution of the input multiple steps into the future. Learning can be achieved by back-propagating the multi-step prediction through time. We test the model on various temporal prediction tasks, and show that higher-order mappings and predictive training both yield a significant improvement over bi-linear models in terms of prediction accuracy.

READ FULL TEXT

page 6

page 7

page 8

research
08/15/2019

HONEM: Network Embedding Using Higher-Order Patterns in Sequential Data

Representation learning offers a powerful alternative to the oft painsta...
research
10/27/2021

Taylor Swift: Taylor Driven Temporal Modeling for Swift Future Frame Prediction

While recurrent neural networks (RNNs) demonstrate outstanding capabilit...
research
08/15/2022

Pyramidal Predictive Network: A Model for Visual-frame Prediction Based on Predictive Coding Theory

Visual-frame prediction is a pixel-dense prediction task that infers fut...
research
06/13/2013

Learning to encode motion using spatio-temporal synchrony

We consider the task of learning to extract motion from videos. To this ...
research
01/29/2017

Transformation-Based Models of Video Sequences

In this work we propose a simple unsupervised approach for next frame pr...
research
09/28/2016

Stabilizing Linear Prediction Models using Autoencoder

To date, the instability of prognostic predictors in a sparse high dimen...
research
11/09/2022

Trackerless freehand ultrasound with sequence modelling and auxiliary transformation over past and future frames

Three-dimensional (3D) freehand ultrasound (US) reconstruction without a...

Please sign up or login with your details

Forgot password? Click here to reset