Flow-Guided Transformer for Video Inpainting

08/14/2022
by   Kaidong Zhang, et al.
6

We propose a flow-guided transformer, which innovatively leverage the motion discrepancy exposed by optical flows to instruct the attention retrieval in transformer for high fidelity video inpainting. More specially, we design a novel flow completion network to complete the corrupted flows by exploiting the relevant flow features in a local temporal window. With the completed flows, we propagate the content across video frames, and adopt the flow-guided transformer to synthesize the rest corrupted regions. We decouple transformers along temporal and spatial dimension, so that we can easily integrate the locally relevant completed flows to instruct spatial attention only. Furthermore, we design a flow-reweight module to precisely control the impact of completed flows on each spatial transformer. For the sake of efficiency, we introduce window partition strategy to both spatial and temporal transformers. Especially in spatial transformer, we design a dual perspective spatial MHSA, which integrates the global tokens to the window-based attention. Extensive experiments demonstrate the effectiveness of the proposed method qualitatively and quantitatively. Codes are available at https://github.com/hitachinsk/FGT.

READ FULL TEXT

page 1

page 11

page 13

page 14

research
01/24/2023

Exploiting Optical Flow Guidance for Transformer-Based Video Inpainting

Transformers have been widely used for video processing owing to the mul...
research
07/20/2020

Learning Joint Spatial-Temporal Transformations for Video Inpainting

High-quality video inpainting that completes missing regions in video fr...
research
09/28/2022

DeViT: Deformed Vision Transformers in Video Inpainting

This paper proposes a novel video inpainting method. We make three main ...
research
08/09/2023

Histogram-guided Video Colorization Structure with Spatial-Temporal Connection

Video colorization, aiming at obtaining colorful and plausible results f...
research
09/07/2023

ProPainter: Improving Propagation and Transformer for Video Inpainting

Flow-based propagation and spatiotemporal Transformer are two mainstream...
research
11/21/2022

FlowLens: Seeing Beyond the FoV via Flow-guided Clip-Recurrent Transformer

Limited by hardware cost and system size, camera's Field-of-View (FoV) i...
research
07/17/2023

Deficiency-Aware Masked Transformer for Video Inpainting

Recent video inpainting methods have made remarkable progress by utilizi...

Please sign up or login with your details

Forgot password? Click here to reset