Temperature Schedules for Self-Supervised Contrastive Methods on Long-Tail Data

03/23/2023
by   Anna Kukleva, et al.
0

Most approaches for self-supervised learning (SSL) are optimised on curated balanced datasets, e.g. ImageNet, despite the fact that natural data usually exhibits long-tail distributions. In this paper, we analyse the behaviour of one of the most popular variants of SSL, i.e. contrastive methods, on long-tail data. In particular, we investigate the role of the temperature parameter τ in the contrastive loss, by analysing the loss through the lens of average distance maximisation, and find that a large τ emphasises group-wise discrimination, whereas a small τ leads to a higher degree of instance discrimination. While τ has thus far been treated exclusively as a constant hyperparameter, in this work, we propose to employ a dynamic τ and show that a simple cosine schedule can yield significant improvements in the learnt representations. Such a schedule results in a constant `task switching' between an emphasis on instance discrimination and group-wise discrimination and thereby ensures that the model learns both group-wise features, as well as instance-specific details. Since frequent classes benefit from the former, while infrequent classes require the latter, we find this method to consistently improve separation between the classes in long-tail data without any additional computational cost.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2022

Understanding the properties and limitations of contrastive learning for Out-of-Distribution detection

A recent popular approach to out-of-distribution (OOD) detection is base...
research
08/09/2020

Unsupervised Feature Learning by Cross-Level Discrimination between Instances and Groups

Unsupervised feature learning has made great strides with invariant mapp...
research
01/26/2021

Revisiting Contrastive Learning for Few-Shot Classification

Instance discrimination based contrastive learning has emerged as a lead...
research
05/19/2023

Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization

In this paper, we aim to optimize a contrastive loss with individualized...
research
04/05/2023

ACTION++: Improving Semi-supervised Medical Image Segmentation with Adaptive Anatomical Contrast

Medical data often exhibits long-tail distributions with heavy class imb...
research
03/16/2023

All4One: Symbiotic Neighbour Contrastive Learning via Self-Attention and Redundancy Reduction

Nearest neighbour based methods have proved to be one of the most succes...
research
02/03/2022

The Met Dataset: Instance-level Recognition for Artworks

This work introduces a dataset for large-scale instance-level recognitio...

Please sign up or login with your details

Forgot password? Click here to reset