Photo-Realistic Video Prediction on Natural Videos of Largely Changing Frames

03/19/2020
by   Osamu Shouno, et al.
24

Recent advances in deep learning have significantly improved performance of video prediction. However, state-of-the-art methods still suffer from blurriness and distortions in their future predictions, especially when there are large motions between frames. To address these issues, we propose a deep residual network with the hierarchical architecture where each layer makes a prediction of future state at different spatial resolution, and these predictions of different layers are merged via top-down connections to generate future frames. We trained our model with adversarial and perceptual loss functions, and evaluated it on a natural video dataset captured by car-mounted cameras. Our model quantitatively outperforms state-of-the-art baselines in future frame prediction on video sequences of both largely and slightly changing frames. Furthermore, our model generates future frames with finer details and textures that are perceptually more realistic than the baselines, especially under fast camera motions.

READ FULL TEXT

page 2

page 10

page 12

page 14

research
02/15/2022

Beyond Natural Motion: Exploring Discontinuity for Video Frame Interpolation

Video interpolation is the task that synthesizes the intermediate frame ...
research
08/01/2017

Dual Motion GAN for Future-Flow Embedded Video Prediction

Future frame prediction in videos is a promising avenue for unsupervised...
research
09/22/2017

Learning to Generate Time-Lapse Videos Using Multi-Stage Dynamic Generative Adversarial Networks

Taking a photo outside, can we predict the immediate future, e.g., how w...
research
03/20/2018

A Temporally-Aware Interpolation Network for Video Frame Inpainting

We propose the first deep learning solution to video frame inpainting, a...
research
12/20/2014

Video (language) modeling: a baseline for generative models of natural videos

We propose a strong baseline model for unsupervised feature learning usi...
research
11/25/2022

WALDO: Future Video Synthesis using Object Layer Decomposition and Parametric Flow Prediction

This paper presents WALDO (WArping Layer-Decomposed Objects), a novel ap...
research
11/13/2020

Adaptive Future Frame Prediction with Ensemble Network

Future frame prediction in videos is a challenging problem because video...

Please sign up or login with your details

Forgot password? Click here to reset