Improved Conditional VRNNs for Video Prediction

04/27/2019
by   Lluis Castrejon, et al.
8

Predicting future frames for a video sequence is a challenging generative modeling task. Promising approaches include probabilistic latent variable models such as the Variational Auto-Encoder. While VAEs can handle uncertainty and model multiple possible future outcomes, they have a tendency to produce blurry predictions. In this work we argue that this is a sign of underfitting. To address this issue, we propose to increase the expressiveness of the latent distributions and to use higher capacity likelihood models. Our approach relies on a hierarchy of latent variables, which defines a family of flexible prior and posterior distributions in order to better model the probability of future sequences. We validate our proposal through a series of ablation experiments and compare our approach to current state-of-the-art latent variable models. Our method performs favorably under several metrics in three different datasets.

READ FULL TEXT

page 1

page 4

page 6

page 7

research
08/31/2020

Future Frame Prediction of a Video Sequence

Predicting future frames of a video sequence has been a problem of high ...
research
07/12/2018

Avoiding Latent Variable Collapse With Generative Skip Models

Variational autoencoders (VAEs) learn distributions of high-dimensional ...
research
04/04/2018

Stochastic Adversarial Video Prediction

Being able to predict what may happen in the future requires an in-depth...
research
12/11/2020

A Log-likelihood Regularized KL Divergence for Video Prediction with A 3D Convolutional Variational Recurrent Network

The use of latent variable models has shown to be a powerful tool for mo...
research
12/20/2022

Semantically-informed Hierarchical Event Modeling

Prior work has shown that coupling sequential latent variable models wit...
research
03/01/2019

Video Extrapolation with an Invertible Linear Embedding

We predict future video frames from complex dynamic scenes, using an inv...
research
03/21/2023

Improving Deep Dynamics Models for Autonomous Vehicles with Multimodal Latent Mapping of Surfaces

The safe deployment of autonomous vehicles relies on their ability to ef...

Please sign up or login with your details

Forgot password? Click here to reset