Consistent Generative Query Networks

07/05/2018
by   Ananya Kumar, et al.
6

Stochastic video prediction is usually framed as an extrapolation problem where the goal is to sample a sequence of consecutive future image frames conditioned on a sequence of observed past frames. For the most part, algorithms for this task generate future video frames sequentially in an autoregressive fashion, which is slow and requires the input and output to be consecutive. We introduce a model that overcomes these drawbacks -- it learns to generate a global latent representation from an arbitrary set of frames within a video. This representation can then be used to simultaneously and efficiently sample any number of temporally consistent frames at arbitrary time-points in the video. We apply our model to synthetic video prediction tasks and achieve results that are comparable to state-of-the-art video prediction models. In addition, we demonstrate the flexibility of our model by applying it to 3D scene reconstruction where we condition on location instead of time. To the best of our knowledge, our model is the first to provide flexible and coherent prediction on stochastic video datasets, as well as consistent 3D scene samples. Please check the project website https://bit.ly/2jX7Vyu to view scene reconstructions and videos produced by our model.

READ FULL TEXT

page 6

page 8

page 14

research
05/01/2020

A Naturalness Evaluation Database for Video Prediction Models

The study of video prediction models is believed to be a fundamental app...
research
09/07/2016

Component-Based Distributed Framework for Coherent and Real-Time Video Dehazing

Traditional dehazing techniques, as a well studied topic in image proces...
research
04/11/2019

KeyIn: Discovering Subgoal Structure with Keyframe-based Video Prediction

Real-world image sequences can often be naturally decomposed into a smal...
research
05/19/2022

Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation

Video prediction is a challenging task. The quality of video frames from...
research
02/25/2019

Generative Models for Low-Rank Video Representation and Reconstruction

Finding compact representation of videos is an essential component in al...
research
05/23/2022

Flexible Diffusion Modeling of Long Videos

We present a framework for video modeling based on denoising diffusion p...
research
08/23/2018

Time-Agnostic Prediction: Predicting Predictable Video Frames

Prediction is arguably one of the most basic functions of an intelligent...

Please sign up or login with your details

Forgot password? Click here to reset