PDWN: Pyramid Deformable Warping Network for Video Interpolation

04/04/2021
by   Zhiqi Chen, et al.
0

Video interpolation aims to generate a non-existent intermediate frame given the past and future frames. Many state-of-the-art methods achieve promising results by estimating the optical flow between the known frames and then generating the backward flows between the middle frame and the known frames. However, these methods usually suffer from the inaccuracy of estimated optical flows and require additional models or information to compensate for flow estimation errors. Following the recent development in using deformable convolution (DConv) for video interpolation, we propose a light but effective model, called Pyramid Deformable Warping Network (PDWN). PDWN uses a pyramid structure to generate DConv offsets of the unknown middle frame with respect to the known frames through coarse-to-fine successive refinements. Cost volumes between warped features are calculated at every pyramid level to help the offset inference. At the finest scale, the two warped frames are adaptively blended to generate the middle frame. Lastly, a context enhancement network further enhances the contextual detail of the final output. Ablation studies demonstrate the effectiveness of the coarse-to-fine offset refinement, cost volumes, and DConv. Our method achieves better or on-par accuracy compared to state-of-the-art models on multiple datasets while the number of model parameters and the inference time are substantially less than previous models. Moreover, we present an extension of the proposed framework to use four input frames, which can achieve significant improvement over using only two input frames, with only a slight increase in the model size and inference time.

READ FULL TEXT

page 1

page 7

page 8

page 9

page 10

page 11

research
02/15/2022

Enhancing Deformable Convolution based Video Frame Interpolation with Coarse-to-fine 3D CNN

This paper presents a new deformable convolution-based video frame inter...
research
11/22/2022

Flow Guidance Deformable Compensation Network for Video Frame Interpolation

Motion-based video frame interpolation (VFI) methods have made remarkabl...
research
01/17/2021

Temporal Spatial-Adaptive Interpolation with Deformable Refinement for Electron Microscopic Images

Recently, flow-based methods have achieved promising success in video fr...
research
11/17/2021

Enhanced Correlation Matching based Video Frame Interpolation

We propose a novel DNN based framework called the Enhanced Correlation M...
research
04/19/2023

AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation

We present All-Pairs Multi-Field Transforms (AMT), a new network archite...
research
11/01/2021

Joint Detection of Motion Boundaries and Occlusions

We propose MONet, a convolutional neural network that jointly detects mo...
research
09/05/2023

Hierarchical Masked 3D Diffusion Model for Video Outpainting

Video outpainting aims to adequately complete missing areas at the edges...

Please sign up or login with your details

Forgot password? Click here to reset