Is a Green Screen Really Necessary for Real-Time Human Matting?

11/24/2020
by   Zhanghan Ke, et al.
1

For human matting without the green screen, existing works either require auxiliary inputs that are costly to obtain or use multiple models that are computationally expensive. Consequently, they are unavailable in real-time applications. In contrast, we present a light-weight matting objective decomposition network (MODNet), which can process human matting from a single input image in real time. The design of MODNet benefits from optimizing a series of correlated sub-objectives simultaneously via explicit constraints. Moreover, since trimap-free methods usually suffer from the domain shift problem in practice, we introduce (1) a self-supervised strategy based on sub-objectives consistency to adapt MODNet to real-world data and (2) a one-frame delay trick to smooth the results when applying MODNet to video human matting. MODNet is easy to be trained in an end-to-end style. It is much faster than contemporaneous matting methods and runs at 63 frames per second. On a carefully designed human matting benchmark newly proposed in this work, MODNet greatly outperforms prior trimap-free methods. More importantly, our method achieves remarkable results in daily photos and videos. Now, do you really need a green screen for real-time human matting?

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 8

page 9

page 10

research
08/25/2021

Robust High-Resolution Video Matting with Temporal Guidance

We introduce a robust, real-time, high-resolution human video matting me...
research
08/13/2023

FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table

Low-Light Video Enhancement (LLVE) has received considerable attention i...
research
08/01/2018

Learning Blind Video Temporal Consistency

Applying image processing algorithms independently to each frame of a vi...
research
05/30/2023

Context-Preserving Two-Stage Video Domain Translation for Portrait Stylization

Portrait stylization, which translates a real human face image into an a...
research
07/14/2022

Relighting4D: Neural Relightable Human from Videos

Human relighting is a highly desirable yet challenging task. Existing wo...
research
10/14/2021

Relighting Humans in the Wild: Monocular Full-Body Human Relighting with Domain Adaptation

The modern supervised approaches for human image relighting rely on trai...
research
06/03/2019

3D Magic Mirror: Automatic Video to 3D Caricature Translation

Caricature is an abstraction of a real person which distorts or exaggera...

Please sign up or login with your details

Forgot password? Click here to reset