Virtual Multi-Modality Self-Supervised Foreground Matting for Human-Object Interaction

10/07/2021
by   Bo Xu, et al.
8

Most existing human matting algorithms tried to separate pure human-only foreground from the background. In this paper, we propose a Virtual Multi-modality Foreground Matting (VMFM) method to learn human-object interactive foreground (human and objects interacted with him or her) from a raw RGB image. The VMFM method requires no additional inputs, e.g. trimap or known background. We reformulate foreground matting as a self-supervised multi-modality problem: factor each input image into estimated depth map, segmentation mask, and interaction heatmap using three auto-encoders. In order to fully utilize the characteristics of each modality, we first train a dual encoder-to-decoder network to estimate the same alpha matte. Then we introduce a self-supervised method: Complementary Learning(CL) to predict deviation probability map and exchange reliable gradients across modalities without label. We conducted extensive experiments to analyze the effectiveness of each modality and the significance of different components in complementary learning. We demonstrate that our model outperforms the state-of-the-art methods.

READ FULL TEXT

page 1

page 3

page 6

page 8

page 12

page 13

page 14

page 15

research
04/01/2019

Self-Supervised Robot In-hand Object Learning

In order to complete tasks in a new environment, robots must be able to ...
research
08/15/2021

SSH: A Self-Supervised Framework for Image Harmonization

Image harmonization aims to improve the quality of image compositing by ...
research
08/30/2021

Digging into Uncertainty in Self-supervised Multi-view Stereo

Self-supervised Multi-view stereo (MVS) with a pretext task of image rec...
research
10/06/2022

CIR-Net: Cross-modality Interaction and Refinement for RGB-D Salient Object Detection

Focusing on the issue of how to effectively capture and utilize cross-mo...
research
08/24/2018

Automatic Foreground Extraction using Multi-Agent Consensus Equilibrium

While foreground extraction is fundamental to virtual reality systems an...
research
06/02/2023

Evaluating The Robustness of Self-Supervised Representations to Background/Foreground Removal

Despite impressive empirical advances of SSL in solving various tasks, t...
research
06/02/2022

Optimizing Relevance Maps of Vision Transformers Improves Robustness

It has been observed that visual classification models often rely mostly...

Please sign up or login with your details

Forgot password? Click here to reset