Folded Recurrent Neural Networks for Future Video Prediction

12/01/2017
by   Marc Oliu, et al.
0

Future video prediction is an ill-posed Computer Vision problem that recently received much attention. Its main challenges are the high variability in video content, the propagation of errors through time, and the non-specificity of the future frames: given a sequence of past frames there is a continuous distribution of possible futures. This work introduces bijective Gated Recurrent Units, a double mapping between the input and output of a GRU layer. This allows for recurrent auto-encoders with state sharing between encoder and decoder, stratifying the sequence representation and helping to prevent capacity problems. We show how with this topology only the encoder or decoder needs to be applied for input encoding and prediction, respectively. This reduces the computational cost and avoids re-encoding the predictions when generating a sequence of frames, mitigating the propagation of errors. Furthermore, it is possible to remove layers from an already trained model, giving an insight to the role performed by each layer and making the model more explainable. We evaluate our approach on three video datasets, outperforming state of the art prediction results on MMNIST and UCF101, and obtaining competitive results on KTH with 2 and 3 times less memory usage and computational cost than the best scored approach.

READ FULL TEXT

page 6

page 7

page 8

page 9

research
11/28/2016

Hierarchical Boundary-Aware Neural Encoder for Video Captioning

The use of Recurrent Neural Networks for video captioning has recently g...
research
12/06/2016

Video Ladder Networks

We present the Video Ladder Network (VLN) for efficiently generating fut...
research
08/11/2018

Pervasive Attention: 2D Convolutional Neural Networks for Sequence-to-Sequence Prediction

Current state-of-the-art machine translation systems are based on encode...
research
04/12/2016

Video Description using Bidirectional Recurrent Neural Networks

Although traditionally used in the machine translation field, the encode...
research
05/23/2023

FlowChroma – A Deep Recurrent Neural Network for Video Colorization

We develop an automated video colorization framework that minimizes the ...
research
05/03/2019

Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning

It is well believed that video captioning is a fundamental but challengi...
research
07/16/2017

RED: Reinforced Encoder-Decoder Networks for Action Anticipation

Action anticipation aims to detect an action before it happens. Many rea...

Please sign up or login with your details

Forgot password? Click here to reset