Deep Video Matting via Spatio-Temporal Alignment and Aggregation

04/22/2021
by   Yanan Sun, et al.
0

Despite the significant progress made by deep learning in natural image matting, there has been so far no representative work on deep learning for video matting due to the inherent technical challenges in reasoning temporal domain and lack of large-scale video matting datasets. In this paper, we propose a deep learning-based video matting framework which employs a novel and effective spatio-temporal feature aggregation module (ST-FAM). As optical flow estimation can be very unreliable within matting regions, ST-FAM is designed to effectively align and aggregate information across different spatial scales and temporal frames within the network decoder. To eliminate frame-by-frame trimap annotations, a lightweight interactive trimap propagation network is also introduced. The other contribution consists of a large-scale video matting dataset with groundtruth alpha mattes for quantitative evaluation and real-world high-resolution videos with trimaps for qualitative evaluation. Quantitative and qualitative experimental results show that our framework significantly outperforms conventional video matting and deep image matting methods applied to video in presence of multi-frame temporal information.

READ FULL TEXT

page 1

page 4

page 7

research
04/28/2019

Spatio-Temporal Filter Adaptive Network for Video Deblurring

Video deblurring is a challenging task due to the spatially variant blur...
research
04/04/2020

Multi-Variate Temporal GAN for Large Scale Video Generation

In this paper, we present a network architecture for video generation th...
research
05/04/2023

ItoV: Efficiently Adapting Deep Learning-based Image Watermarking to Video Watermarking

Robust watermarking tries to conceal information within a cover image/vi...
research
05/13/2022

The Effectiveness of Temporal Dependency in Deepfake Video Detection

Deepfakes are a form of synthetic image generation used to generate fake...
research
07/24/2019

StableNet: Semi-Online, Multi-Scale Deep Video Stabilization

Video stabilization algorithms are of greater importance nowadays with t...
research
06/06/2021

Technical Report: Temporal Aggregate Representations

This technical report extends our work presented in [9] with more experi...
research
05/31/2021

VidFace: A Full-Transformer Solver for Video FaceHallucination with Unaligned Tiny Snapshots

In this paper, we investigate the task of hallucinating an authentic hig...

Please sign up or login with your details

Forgot password? Click here to reset