Cross-Attention Transformer for Video Interpolation

07/08/2022
by   Hannah Halin Kim, et al.
1

We propose TAIN (Transformers and Attention for video INterpolation), a residual neural network for video interpolation, which aims to interpolate an intermediate frame given two consecutive image frames around it. We first present a novel visual transformer module, named Cross-Similarity (CS), to globally aggregate input image features with similar appearance as those of the predicted interpolated frame. These CS features are then used to refine the interpolated prediction. To account for occlusions in the CS features, we propose an Image Attention (IA) module to allow the network to focus on CS features from one frame over those of the other. Additionally, we augment our training dataset with an occluder patch that moves across frames to improve the network's robustness to occlusions and large motion. Because existing methods yield smooth predictions especially near MBs, we use an additional training loss based on image gradient to yield sharper predictions. TAIN outperforms existing methods that do not require flow estimation and performs comparably to flow-based methods while being computationally efficient in terms of inference time on Vimeo90k, UCF101, and SNU-FILM benchmarks.

READ FULL TEXT

page 7

page 8

page 12

page 13

research
11/30/2017

Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation

Given two consecutive frames, video interpolation aims at generating int...
research
05/17/2021

EA-Net: Edge-Aware Network for Flow-based Video Frame Interpolation

Video frame interpolation can up-convert the frame rate and enhance the ...
research
06/24/2023

Boost Video Frame Interpolation via Motion Adaptation

Video frame interpolation (VFI) is a challenging task that aims to gener...
research
11/21/2022

H-VFI: Hierarchical Frame Interpolation for Videos with Large Motions

Capitalizing on the rapid development of neural networks, recent video f...
research
05/15/2022

Video Frame Interpolation with Transformer

Video frame interpolation (VFI), which aims to synthesize intermediate f...
research
11/19/2022

NIO: Lightweight neural operator-based architecture for video frame interpolation

We present, NIO - Neural Interpolation Operator, a lightweight efficient...
research
07/12/2023

Efficient Convolution and Transformer-Based Network for Video Frame Interpolation

Video frame interpolation is an increasingly important research task wit...

Please sign up or login with your details

Forgot password? Click here to reset