WALDO: Future Video Synthesis using Object Layer Decomposition and Parametric Flow Prediction

11/25/2022
by   Guillaume Le Moing, et al.
0

This paper presents WALDO (WArping Layer-Decomposed Objects), a novel approach to the prediction of future video frames from past ones. Individual images are decomposed into multiple layers combining object masks and a small set of control points. The layer structure is shared across all frames in each video to build dense inter-frame connections. Complex scene motions are modeled by combining parametric geometric transformations associated with individual layers, and video synthesis is broken down into discovering the layers associated with past frames, predicting the corresponding transformations for upcoming ones and warping the associated object regions accordingly, and filling in the remaining image parts. Extensive experiments on the Cityscapes (resp. KITTI) dataset show that WALDO significantly outperforms prior works with, e.g., 3, 27, and 51 LPIPS and FVD metrics. Code, pretrained models, and video samples synthesized by our approach can be found in the project webpage https://16lemoing.github.io/waldo.

READ FULL TEXT

page 1

page 2

page 7

page 8

page 15

page 19

page 20

page 21

research
04/20/2021

Learning Semantic-Aware Dynamics for Video Prediction

We propose an architecture and training scheme to predict video frames b...
research
04/01/2020

Future Video Synthesis with Object Motion Prediction

We present an approach to predict future video frames given a sequence o...
research
08/20/2021

Out-of-boundary View Synthesis Towards Full-Frame Video Stabilization

Warping-based video stabilizers smooth camera trajectory by constraining...
research
06/13/2022

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

Recent action recognition models have achieved impressive results by int...
research
03/19/2020

Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames

Recent advances in deep learning have significantly improved performance...
research
05/05/2022

Parametric Reshaping of Portraits in Videos

Sharing short personalized videos to various social media networks has b...
research
07/06/2021

iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis

How would a static scene react to a local poke? What are the effects on ...

Please sign up or login with your details

Forgot password? Click here to reset