Efficient Convolution and Transformer-Based Network for Video Frame Interpolation

07/12/2023
by   Issa Khalifeh, et al.
1

Video frame interpolation is an increasingly important research task with several key industrial applications in the video coding, broadcast and production sectors. Recently, transformers have been introduced to the field resulting in substantial performance gains. However, this comes at a cost of greatly increased memory usage, training and inference time. In this paper, a novel method integrating a transformer encoder and convolutional features is proposed. This network reduces the memory burden by close to 50 four times faster during inference time compared to existing transformer-based interpolation methods. A dual-encoder architecture is introduced which combines the strength of convolutions in modelling local correlations with those of the transformer for long-range dependencies. Quantitative evaluations are conducted on various benchmarks with complex motion to showcase the robustness of the proposed method, achieving competitive performance compared to state-of-the-art interpolation networks.

READ FULL TEXT

page 1

page 3

research
07/30/2023

Video Frame Interpolation with Flow Transformer

Video frame interpolation has been actively studied with the development...
research
05/15/2022

Video Frame Interpolation with Transformer

Video frame interpolation (VFI), which aims to synthesize intermediate f...
research
11/02/2020

Revisiting Adaptive Convolutions for Video Frame Interpolation

Video frame interpolation, the synthesis of novel views in time, is an i...
research
09/19/2022

NIERT: Accurate Numerical Interpolation through Unifying Scattered Data Representations using Transformer Encoder

Numerical interpolation for scattered data aims to estimate values for t...
research
07/08/2022

Cross-Attention Transformer for Video Interpolation

We propose TAIN (Transformers and Attention for video INterpolation), a ...
research
11/19/2022

NIO: Lightweight neural operator-based architecture for video frame interpolation

We present, NIO - Neural Interpolation Operator, a lightweight efficient...
research
06/07/2023

Optimizing ViViT Training: Time and Memory Reduction for Action Recognition

In this paper, we address the challenges posed by the substantial traini...

Please sign up or login with your details

Forgot password? Click here to reset