MMVP: Motion-Matrix-based Video Prediction

08/30/2023
by   Yiqi Zhong, et al.
0

A central challenge of video prediction lies where the system has to reason the objects' future motions from image frames while simultaneously maintaining the consistency of their appearances across frames. This work introduces an end-to-end trainable two-stream video prediction framework, Motion-Matrix-based Video Prediction (MMVP), to tackle this challenge. Unlike previous methods that usually handle motion prediction and appearance maintenance within the same set of modules, MMVP decouples motion and appearance information by constructing appearance-agnostic motion matrices. The motion matrices represent the temporal similarity of each and every pair of feature patches in the input frames, and are the sole input of the motion prediction module in MMVP. This design improves video prediction in both accuracy and efficiency, and reduces the model size. Results of extensive experiments demonstrate that MMVP outperforms state-of-the-art systems on public data sets by non-negligible large margins (about 1 db in PSNR, UCF Sports) in significantly smaller model sizes (84 size or smaller).

READ FULL TEXT

page 7

page 8

research
08/05/2021

SLAMP: Stochastic Latent Appearance and Motion Prediction

Motion is an important cue for video prediction and often utilized by se...
research
12/11/2019

G^3AN: This video does not exist. Disentangling motion and appearance for video generation

Creating realistic human videos introduces the challenge of being able t...
research
01/01/2017

Video-based Person Re-identification with Accumulative Motion Context

Video based person re-identification plays a central role in realistic s...
research
07/07/2018

Video Prediction with Appearance and Motion Conditions

Video prediction aims to generate realistic future frames by learning dy...
research
10/06/2022

Text-driven Video Prediction

Current video generation models usually convert signals indicating appea...
research
07/02/2020

Understanding Road Layout from Videos as a Whole

In this paper, we address the problem of inferring the layout of complex...
research
06/15/2022

VCT: A Video Compression Transformer

We show how transformers can be used to vastly simplify neural video com...

Please sign up or login with your details

Forgot password? Click here to reset