SDCNet: Video Prediction Using Spatially-Displaced Convolution

11/02/2018
by   Fitsum A. Reda, et al.
6

We present an approach for high-resolution video frame prediction by conditioning on both past frames and past optical flows. Previous approaches rely on resampling past frames, guided by a learned future optical flow, or on direct generation of pixels. Resampling based on flow is insufficient because it cannot deal with disocclusions. Generative models currently lead to blurry results. Recent approaches synthesis a pixel by convolving input patches with a predicted kernel. However, their memory requirement increases with kernel size. Here, we spatially-displaced convolution (SDC) module for video frame prediction. We learn a motion vector and a kernel for each pixel and synthesize a pixel by applying the kernel at a displaced location in the source image, defined by the predicted motion vector. Our approach inherits the merits of both vector-based and kernel-based approaches, while ameliorating their respective disadvantages. We train our model on 428K unlabelled 1080p video game frames. Our approach produces state-of-the-art results, achieving an SSIM score of 0.904 on high-definition YouTube-8M videos, 0.918 on Caltech Pedestrian videos. Our model handles large motion effectively and synthesizes crisp frames with consistent motion.

READ FULL TEXT

page 1

page 9

page 11

page 12

page 13

page 14

research
03/22/2017

Video Frame Interpolation via Adaptive Convolution

Video frame interpolation typically involves two steps: motion estimatio...
research
08/01/2017

Dual Motion GAN for Future-Flow Embedded Video Prediction

Future frame prediction in videos is a promising avenue for unsupervised...
research
11/26/2022

Randomized Conditional Flow Matching for Video Prediction

We introduce a novel generative model for video prediction based on late...
research
05/01/2020

A Naturalness Evaluation Database for Video Prediction Models

The study of video prediction models is believed to be a fundamental app...
research
05/26/2020

End-to-end Optimized Video Compression with MV-Residual Prediction

We present an end-to-end trainable framework for P-frame compression in ...
research
03/06/2023

Polar Prediction of Natural Videos

Observer motion and continuous deformations of objects and surfaces imbu...
research
02/25/2023

RipViz: Finding Rip Currents by Learning Pathline Behavior

We present a hybrid machine learning and flow analysis feature detection...

Please sign up or login with your details

Forgot password? Click here to reset