Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

03/01/2023
by   Guozhen Zhang, et al.
0

Effectively extracting inter-frame motion and appearance information is important for video frame interpolation (VFI). Previous works either extract both types of information in a mixed way or elaborate separate modules for each type of information, which lead to representation ambiguity and low efficiency. In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation. Specifically, we rethink the information process in inter-frame attention and reuse its attention map for both appearance feature enhancement and motion information extraction. Furthermore, for efficient VFI, our proposed module could be seamlessly integrated into a hybrid CNN and Transformer architecture. This hybrid pipeline can alleviate the computational complexity of inter-frame attention as well as preserve detailed low-level structure information. Experimental results demonstrate that, for both fixed- and arbitrary-timestep interpolation, our method achieves state-of-the-art performance on various datasets. Meanwhile, our approach enjoys a lighter computation overhead over models with close performance. The source code and models are available at https://github.com/MCG-NJU/EMA-VFI.

READ FULL TEXT

page 7

page 8

research
04/05/2023

BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation

A novel 4K video frame interpolator based on bilateral transformer (BiFo...
research
07/31/2023

Uncertainty-Guided Spatial Pruning Architecture for Efficient Frame Interpolation

The video frame interpolation (VFI) model applies the convolution operat...
research
08/13/2023

FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table

Low-Light Video Enhancement (LLVE) has received considerable attention i...
research
03/15/2023

Skinned Motion Retargeting with Residual Perception of Motion Semantics Geometry

A good motion retargeting cannot be reached without reasonable considera...
research
02/27/2020

Blurry Video Frame Interpolation

Existing works reduce motion blur and up-convert frame rate through two ...
research
04/16/2023

CAT-NeRF: Constancy-Aware Tx^2Former for Dynamic Body Modeling

This paper addresses the problem of human rendering in the video with te...
research
01/18/2023

Gated-ViGAT: Efficient Bottom-Up Event Recognition and Explanation Using a New Frame Selection Policy and Gating Mechanism

In this paper, Gated-ViGAT, an efficient approach for video event recogn...

Please sign up or login with your details

Forgot password? Click here to reset