One-Trimap Video Matting

07/27/2022
by   Hongje Seong, et al.
0

Recent studies made great progress in video matting by extending the success of trimap-based image matting to the video domain. In this paper, we push this task toward a more practical setting and propose One-Trimap Video Matting network (OTVM) that performs video matting robustly using only one user-annotated trimap. A key of OTVM is the joint modeling of trimap propagation and alpha prediction. Starting from baseline trimap propagation and alpha prediction networks, our OTVM combines the two networks with an alpha-trimap refinement module to facilitate information flow. We also present an end-to-end training strategy to take full advantage of the joint model. Our joint modeling greatly improves the temporal stability of trimap propagation compared to the previous decoupled methods. We evaluate our model on two latest video matting benchmarks, Deep Video Matting and VideoMatting108, and outperform state-of-the-art by significant margins (MSE improvements of 56.4 and 56.7 https://github.com/Hongje/OTVM.

READ FULL TEXT

page 13

page 14

page 28

page 29

page 30

page 31

page 32

page 33

research
03/30/2022

End to End Lip Synchronization with a Temporal AutoEncoder

We study the problem of syncing the lip movement in a video with the aud...
research
04/06/2022

Multi-Scale Memory-Based Video Deblurring

Video deblurring has achieved remarkable progress thanks to the success ...
research
05/05/2018

Revisiting Temporal Modeling for Video-based Person ReID

Video-based person reID is an important task, which has received much at...
research
09/12/2017

End-to-End United Video Dehazing and Detection

The recent development of CNN-based image dehazing has revealed the effe...
research
03/27/2021

Video Rescaling Networks with Joint Optimization Strategies for Downscaling and Upscaling

This paper addresses the video rescaling task, which arises from the nee...
research
03/01/2019

Frequency Domain Transformer Networks for Video Prediction

The task of video prediction is forecasting the next frames given some p...
research
01/26/2023

Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring

Image-text pretrained models, e.g., CLIP, have shown impressive general ...

Please sign up or login with your details

Forgot password? Click here to reset