DeepAI AI Chat
Log In Sign Up

Trimap-guided Feature Mining and Fusion Network for Natural Image Matting

by   Weihao Jiang, et al.
Shanghai Jiao Tong University
ByteDance Inc.

Utilizing trimap guidance and fusing multi-level features are two important issues for trimap-based matting with pixel-level prediction. To utilize trimap guidance, most existing approaches simply concatenate trimaps and images together to feed a deep network or apply an extra network to extract more trimap guidance, which meets the conflict between efficiency and effectiveness. For emerging content-based feature fusion, most existing matting methods only focus on local features which lack the guidance of a global feature with strong semantic information related to the interesting object. In this paper, we propose a trimap-guided feature mining and fusion network consisting of our trimap-guided non-background multi-scale pooling (TMP) module and global-local context-aware fusion (GLF) modules. Considering that trimap provides strong semantic guidance, our TMP module focuses effective feature mining on interesting objects under the guidance of trimap without extra parameters. Furthermore, our GLF modules use global semantic information of interesting objects mined by our TMP module to guide an effective global-local context-aware multi-level feature fusion. In addition, we build a common interesting object matting (CIOM) dataset to advance high-quality image matting. Experimental results on the Composition-1k test set, Alphamatting benchmark, and our CIOM test set demonstrate that our method outperforms state-of-the-art approaches. Code and models will be publicly available soon.


page 1

page 4

page 6

page 7

page 8


Context-aware Cross-level Fusion Network for Camouflaged Object Detection

Camouflaged object detection (COD) is a challenging task due to the low ...

Camouflaged Object Detection via Context-aware Cross-level Fusion

Camouflaged object detection (COD) aims to identify the objects that con...

Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing

RGB thermal scene parsing has recently attracted increasing research int...

Learning Local Features with Context Aggregation for Visual Localization

Keypoint detection and description is fundamental yet important in many ...

Towards Accurate Camouflaged Object Detection with Mixture Convolution and Interactive Fusion

Camouflaged object detection (COD), which aims to identify the objects t...

Instance-aware Image Colorization

Image colorization is inherently an ill-posed problem with multi-modal u...

Multi-spectral Class Center Network for Face Manipulation Detection and Localization

As Deepfake contents continue to proliferate on the internet, advancing ...