Stochastic Variational Video Prediction

10/30/2017
by   Mohammad Babaeizadeh, et al.
0

Predicting the future in real-world settings, particularly from raw sensory observations such as images, is exceptionally challenging. Real-world events can be stochastic and unpredictable, and the high dimensionality and complexity of natural images requires the predictive model to build an intricate understanding of the natural world. Many existing methods tackle this problem by making simplifying assumptions about the environment. One common assumption is that the outcome is deterministic and there is only one plausible future. This can lead to low-quality predictions in real-world settings with stochastic dynamics. In this paper, we develop a stochastic variational video prediction (SV2P) method that predicts a different possible future for each sample of its latent variables. To the best of our knowledge, our model is the first to provide effective stochastic multi-frame prediction for real-world video. We demonstrate the capability of the proposed method in predicting detailed future frames of videos on multiple real-world datasets, both action-free and action-conditioned. We find that our proposed method produces substantially improved video predictions when compared to the same model without stochasticity, and to other stochastic video prediction methods. Our SV2P implementation will be open sourced upon publication.

READ FULL TEXT

page 2

page 9

page 10

research
05/23/2016

Unsupervised Learning for Physical Interaction through Video Prediction

A core challenge for an agent learning to interact with the world is to ...
research
03/04/2019

VideoFlow: A Flow-Based Generative Model for Video

Generative models that can model and predict sequences of future events ...
research
10/06/2021

A Hierarchical Variational Neural Uncertainty Model for Stochastic Video Prediction

Predicting the future frames of a video is a challenging task, in part d...
research
11/12/2019

Experience-Embedded Visual Foresight

Visual foresight gives an agent a window into the future, which it can u...
research
04/04/2018

Stochastic Adversarial Video Prediction

Being able to predict what may happen in the future requires an in-depth...
research
05/05/2017

Motion Prediction Under Multimodality with Conditional Stochastic Networks

Given a visual history, multiple future outcomes for a video scene are e...
research
09/19/2022

T3VIP: Transformation-based 3D Video Prediction

For autonomous skill acquisition, robots have to learn about the physica...

Please sign up or login with your details

Forgot password? Click here to reset