The Hidden Uniform Cluster Prior in Self-Supervised Learning

10/13/2022
by   Mahmoud Assran, et al.
0

A successful paradigm in representation learning is to perform self-supervised pretraining using tasks based on mini-batch statistics (e.g., SimCLR, VICReg, SwAV, MSN). We show that in the formulation of all these methods is an overlooked prior to learn features that enable uniform clustering of the data. While this prior has led to remarkably semantic representations when pretraining on class-balanced data, such as ImageNet, we demonstrate that it can hamper performance when pretraining on class-imbalanced data. By moving away from conventional uniformity priors and instead preferring power-law distributed feature clusters, we show that one can improve the quality of the learned representations on real-world class-imbalanced datasets. To demonstrate this, we develop an extension of the Masked Siamese Networks (MSN) method to support the use of arbitrary features priors.

READ FULL TEXT

page 7

page 8

page 9

research
05/12/2021

When Does Contrastive Visual Representation Learning Work?

Recent self-supervised representation learning techniques have largely c...
research
08/19/2023

Efficient Representation Learning for Healthcare with Cross-Architectural Self-Supervision

In healthcare and biomedical applications, extreme computational require...
research
06/07/2022

TriBYOL: Triplet BYOL for Self-Supervised Representation Learning

This paper proposes a novel self-supervised learning method for learning...
research
09/06/2022

Robust and Efficient Imbalanced Positive-Unlabeled Learning with Self-supervision

Learning from positive and unlabeled (PU) data is a setting where the le...
research
08/03/2023

MAP: A Model-agnostic Pretraining Framework for Click-through Rate Prediction

With the widespread application of personalized online services, click-t...
research
09/07/2022

Prior Knowledge-Guided Attention in Self-Supervised Vision Transformers

Recent trends in self-supervised representation learning have focused on...
research
04/08/2021

The Single-Noun Prior for Image Clustering

Self-supervised clustering methods have achieved increasing accuracy in ...

Please sign up or login with your details

Forgot password? Click here to reset