Boosting Semantic Human Matting with Coarse Annotations

by   Jinlin Liu, et al.

Semantic human matting aims to estimate the per-pixel opacity of the foreground human regions. It is quite challenging and usually requires user interactive trimaps and plenty of high quality annotated data. Annotating such kind of data is labor intensive and requires great skills beyond normal users, especially considering the very detailed hair part of humans. In contrast, coarse annotated human dataset is much easier to acquire and collect from the public dataset. In this paper, we propose to use coarse annotated data coupled with fine annotated data to boost end-to-end semantic human matting without trimaps as extra input. Specifically, we train a mask prediction network to estimate the coarse semantic mask using the hybrid data, and then propose a quality unification network to unify the quality of the previous coarse mask outputs. A matting refinement network takes in the unified mask and the input image to predict the final alpha matte. The collected coarse annotated dataset enriches our dataset significantly, allows generating high quality alpha matte for real images. Experimental results show that the proposed method performs comparably against state-of-the-art methods. Moreover, the proposed method can be used for refining coarse annotated public dataset, as well as semantic segmentation methods, which reduces the cost of annotating high quality human data to a great extent.


page 1

page 3

page 4

page 5

page 6

page 7

page 8


Coarse-to-Fine Annotation Enrichment for Semantic Segmentation Learning

Rich high-quality annotated data is critical for semantic segmentation l...

Semantic Human Matting

Human matting, high quality extraction of humans from natural images, is...

Semantic Segmentation with Scarce Data

Semantic segmentation is a challenging vision problem that usually neces...

Predicting How to Distribute Work Between Algorithms and Humans to Segment an Image Batch

Foreground object segmentation is a critical step for many image analysi...

LMQFormer: A Laplace-Prior-Guided Mask Query Transformer for Lightweight Snow Removal

Snow removal aims to locate snow areas and recover clean images without ...

AlphaNet: An Attention Guided Deep Network for Automatic Image Matting

In this paper, we propose an end to end solution for image matting i.e h...

SEMPART: Self-supervised Multi-resolution Partitioning of Image Semantics

Accurately determining salient regions of an image is challenging when l...

Please sign up or login with your details

Forgot password? Click here to reset