Align-and-Attend Network for Globally and Locally Coherent Video Inpainting

05/30/2019
by   Sanghyun Woo, et al.
0

We propose a novel feed-forward network for video inpainting. We use a set of sampled video frames as the reference to take visible contents to fill the hole of a target frame. Our video inpainting network consists of two stages. The first stage is an alignment module that uses computed homographies between the reference frames and the target frame. The visible patches are then aggregated based on the frame similarity to fill in the target holes roughly. The second stage is a non-local attention module that matches the generated patches with known reference patches (in space and time) to refine the previous global alignment stage. Both stages consist of large spatial-temporal window size for the reference and thus enable modeling long-range correlations between distant information and the hole regions. Therefore, even challenging scenes with large or slowly moving holes can be handled, which have been hardly modeled by existing flow-based approach. Our network is also designed with a recurrent propagation stream to encourage temporal consistency in video results. Experiments on video object removal demonstrate that our method inpaints the holes with globally and locally coherent contents.

READ FULL TEXT

page 2

page 8

research
08/23/2019

Onion-Peel Networks for Deep Video Completion

We propose the onion-peel networks for video completion. Given a set of ...
research
06/04/2021

Temporally coherent video anonymization through GAN inpainting

This work tackles the problem of temporally coherent face anonymization ...
research
05/19/2022

Towards Unified Keyframe Propagation Models

Many video editing tasks such as rotoscoping or object removal require t...
research
08/30/2019

Copy-and-Paste Networks for Deep Video Inpainting

We present a novel deep learning based algorithm for video inpainting. V...
research
05/29/2022

Feature-Aligned Video Raindrop Removal with Temporal Constraints

Existing adherent raindrop removal methods focus on the detection of the...
research
03/07/2021

ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring

Video deblurring models exploit consecutive frames to remove blurs from ...

Please sign up or login with your details

Forgot password? Click here to reset