Unbiased Supervised Contrastive Learning

11/10/2022
by   Carlo Alberto Barbano, et al.
0

Many datasets are biased, namely they contain easy-to-learn features that are highly correlated with the target class only in the dataset but not in the true underlying distribution of the data. For this reason, learning unbiased models from biased data has become a very relevant research topic in the last years. In this work, we tackle the problem of learning representations that are robust to biases. We first present a margin-based theoretical framework that allows us to clarify why recent contrastive losses (InfoNCE, SupCon, etc.) can fail when dealing with biased data. Based on that, we derive a novel formulation of the supervised contrastive loss (epsilon-SupInfoNCE), providing more accurate control of the minimal distance between positive and negative samples. Furthermore, thanks to our theoretical framework, we also propose FairKL, a new debiasing regularization loss, that works well even with extremely biased data. We validate the proposed losses on standard vision datasets including CIFAR10, CIFAR100, and ImageNet, and we assess the debiasing capability of FairKL with epsilon-SupInfoNCE, reaching state-of-the-art performance on a number of biased datasets, including real instances of biases in the wild.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2022

Feature-Level Debiased Natural Language Understanding

Natural language understanding (NLU) models often rely on dataset biases...
research
10/11/2022

Efficient debiasing with contrastive weight pruning

Neural networks are often biased to spuriously correlated features that ...
research
06/03/2022

Rethinking Positive Sampling for Contrastive Learning with Kernel

Data augmentation is a crucial component in unsupervised contrastive lea...
research
10/10/2022

Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning

Models for Visual Question Answering (VQA) often rely on the spurious co...
research
04/07/2023

Supervised Contrastive Learning with Heterogeneous Similarity for Distribution Shifts

Distribution shifts are problems where the distribution of data changes ...
research
04/27/2020

Optimal Decisions of a Rational Agent in the Presence of Biased Information Providers

We consider information networks whereby multiple biased-information-pro...
research
07/29/2023

Debiased Pairwise Learning from Positive-Unlabeled Implicit Feedback

Learning contrastive representations from pairwise comparisons has achie...

Please sign up or login with your details

Forgot password? Click here to reset