Multiview Cross-supervision for Semantic Segmentation

12/04/2018
by   Yuan Yao, et al.
4

This paper presents a semi-supervised learning framework for a customized semantic segmentation task using multiview image streams. A key challenge of the customized task lies in the limited accessibility of the labeled data due to the requirement of prohibitive manual annotation effort. We hypothesize that it is possible to leverage multiview image streams that are linked through the underlying 3D geometry, which can provide an additional supervisionary signal to train a segmentation model. We formulate a new cross-supervision method using a shape belief transfer---the segmentation belief in one image is used to predict that of the other image through epipolar geometry analogous to shape-from-silhouette. The shape belief transfer provides the upper and lower bounds of the segmentation for the unlabeled data where its gap approaches asymptotically to zero as the number of the labeled views increases. We integrate this theory to design a novel network that is agnostic to camera calibration, network model, and semantic category and bypasses the intermediate process of suboptimal 3D reconstruction. We validate this network by recognizing a customized semantic category per pixel from realworld visual data including non-human species and a subject of interest in social videos where attaining large-scale annotation data is infeasible.

READ FULL TEXT

page 12

page 20

page 21

page 22

page 23

page 24

page 25

page 26

research
12/30/2019

Discovering Latent Classes for Semi-Supervised Semantic Segmentation

High annotation costs are a major bottleneck for the training of semanti...
research
11/27/2018

Multiview Supervision By Registration

This paper presents a semi-supervised learning framework to train a keyp...
research
07/21/2020

Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images

Semi-supervised learning has attracted much attention in medical image s...
research
05/31/2018

MONET: Multiview Semi-supervised Keypoint via Epipolar Divergence

This paper presents MONET---an end-to-end semi-supervised learning frame...
research
04/07/2021

Streaming Self-Training via Domain-Agnostic Unlabeled Images

We present streaming self-training (SST) that aims to democratize the pr...
research
11/21/2022

Instance-specific and Model-adaptive Supervision for Semi-supervised Semantic Segmentation

Recently, semi-supervised semantic segmentation has achieved promising p...
research
12/13/2021

PartGlot: Learning Shape Part Segmentation from Language Reference Games

We introduce PartGlot, a neural framework and associated architectures f...

Please sign up or login with your details

Forgot password? Click here to reset