Compositional Video Prediction

08/22/2019
by   Yufei Ye, et al.
3

We present an approach for pixel-level future prediction given an input image of a scene. We observe that a scene is comprised of distinct entities that undergo motion and present an approach that operationalizes this insight. We implicitly predict future states of independent entities while reasoning about their interactions, and compose future video frames using these predicted states. We overcome the inherent multi-modality of the task using a global trajectory-level latent random variable, and show that this allows us to sample diverse and plausible futures. We empirically validate our approach against alternate representations and ways of incorporating multi-modality. We examine two datasets, one comprising of stacked objects that may fall, and the other containing videos of humans performing activities in a gym, and show that our approach allows realistic stochastic video prediction across these diverse settings. See https://judyye.github.io/CVP/ for video predictions.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
04/01/2020

Future Video Synthesis with Object Motion Prediction

We present an approach to predict future video frames given a sequence o...
research
01/17/2019

Disentangling Video with Independent Prediction

We propose an unsupervised variational model for disentangling video int...
research
07/09/2021

Diverse Video Generation using a Gaussian Process Trigger

Generating future frames given a few context (or past) frames is a chall...
research
06/21/2021

Understanding Object Dynamics for Interactive Image-to-Video Synthesis

What would be the effect of locally poking a static scene? We present an...
research
07/03/2020

Video Prediction via Example Guidance

In video prediction tasks, one major challenge is to capture the multi-m...
research
10/12/2021

Fourier-based Video Prediction through Relational Object Motion

The ability to predict future outcomes conditioned on observed video fra...
research
06/25/2016

An Uncertain Future: Forecasting from Static Images using Variational Autoencoders

In a given scene, humans can often easily predict a set of immediate fut...

Please sign up or login with your details

Forgot password? Click here to reset