Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals

02/11/2021
by   Wouter Van Gansbeke, et al.
9

Being able to learn dense semantic representations of images without supervision is an important problem in computer vision. However, despite its significance, this problem remains rather unexplored, with a few exceptions that considered unsupervised semantic segmentation on small-scale datasets with a narrow visual domain. In this paper, we make a first attempt to tackle the problem on datasets that have been traditionally utilized for the supervised case. To achieve this, we introduce a novel two-step framework that adopts a predetermined prior in a contrastive optimization objective to learn pixel embeddings. This marks a large deviation from existing works that relied on proxy tasks or end-to-end clustering. Additionally, we argue about the importance of having a prior that contains information about objects, or their parts, and discuss several possibilities to obtain such a prior in an unsupervised manner. Extensive experimental evaluation shows that the proposed method comes with key advantages over existing works. First, the learned pixel embeddings can be directly clustered in semantic groups using K-Means. Second, the method can serve as an effective unsupervised pre-training for the semantic segmentation task. In particular, when fine-tuning the learned representations using just 1 of labeled examples on PASCAL, we outperform supervised ImageNet pre-training by 7.1 https://github.com/wvangansbeke/Unsupervised-Semantic-Segmentation.

READ FULL TEXT

page 1

page 3

page 4

page 8

page 11

research
09/02/2023

RevColV2: Exploring Disentangled Representations in Masked Image Modeling

Masked image modeling (MIM) has become a prevalent pre-training setup fo...
research
10/12/2022

Dynamic Clustering Network for Unsupervised Semantic Segmentation

Recently, the ability of self-supervised Vision Transformer (ViT) to rep...
research
05/30/2021

CLEVE: Contrastive Pre-training for Event Extraction

Event extraction (EE) has considerably benefited from pre-trained langua...
research
07/04/2022

Distilling Ensemble of Explanations for Weakly-Supervised Pre-Training of Image Segmentation Models

While fine-tuning pre-trained networks has become a popular way to train...
research
11/24/2021

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

This paper explores a better codebook for BERT pre-training of vision tr...
research
12/07/2017

Per-Pixel Feedback for improving Semantic Segmentation

Semantic segmentation is the task of assigning a label to each pixel in ...
research
03/30/2021

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering

We present a new framework for semantic segmentation without annotations...

Please sign up or login with your details

Forgot password? Click here to reset