Stochastic Adversarial Video Prediction

04/04/2018
by   Alex X. Lee, et al.
0

Being able to predict what may happen in the future requires an in-depth understanding of the physical and causal rules that govern the world. A model that is able to do so has a number of appealing applications, from robotic planning to representation learning. However, learning to predict raw future observations, such as frames in a video, is exceedingly challenging -- the ambiguous nature of the problem can cause a naively designed model to average together possible futures into a single, blurry prediction. Recently, this has been addressed by two distinct approaches: (a) latent variational variable models that explicitly model underlying stochasticity and (b) adversarially-trained models that aim to produce naturalistic images. However, a standard latent variable model can struggle to produce realistic results, and a standard adversarially-trained model underutilizes latent variables and fails to produce diverse predictions. We show that these distinct methods are in fact complementary. Combining the two produces predictions that look more realistic to human raters and better cover the range of possible futures. Our method outperforms prior and concurrent work in these aspects.

READ FULL TEXT

page 2

page 7

page 11

page 12

page 13

page 24

research
08/31/2020

Future Frame Prediction of a Video Sequence

Predicting future frames of a video sequence has been a problem of high ...
research
04/27/2019

Improved Conditional VRNNs for Video Prediction

Predicting future frames for a video sequence is a challenging generativ...
research
10/30/2017

Stochastic Variational Video Prediction

Predicting the future in real-world settings, particularly from raw sens...
research
06/25/2016

An Uncertain Future: Forecasting from Static Images using Variational Autoencoders

In a given scene, humans can often easily predict a set of immediate fut...
research
12/18/2018

Learning Direct Optimization for Scene Understanding

We introduce a Learning Direct Optimization method for the refinement of...
research
03/06/2021

Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction

A video prediction model that generalizes to diverse scenes would enable...
research
11/14/2017

Prediction Under Uncertainty with Error-Encoding Networks

In this work we introduce a new framework for performing temporal predic...

Please sign up or login with your details

Forgot password? Click here to reset