Unsupervised Learning of Debiased Representations with Pseudo-Attributes

08/06/2021
by   Seonguk Seo, et al.
0

Dataset bias is a critical challenge in machine learning, and its negative impact is aggravated when models capture unintended decision rules with spurious correlations. Although existing works often handle this issue using human supervision, the availability of the proper annotations is impractical and even unrealistic. To better tackle this challenge, we propose a simple but effective debiasing technique in an unsupervised manner. Specifically, we perform clustering on the feature embedding space and identify pseudoattributes by taking advantage of the clustering results even without an explicit attribute supervision. Then, we employ a novel cluster-based reweighting scheme for learning debiased representation; this prevents minority groups from being discounted for minimizing the overall loss, which is desirable for worst-case generalization. The extensive experiments demonstrate the outstanding performance of our approach on multiple standard benchmarks, which is even as competitive as the supervised counterpart.

READ FULL TEXT
research
04/13/2019

Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering

Recently, distant supervision has gained great success on Fine-grained E...
research
04/26/2022

Unsupervised Learning of Unbiased Visual Representations

Deep neural networks are known for their inability to learn robust repre...
research
10/03/2020

Consensus Clustering with Unsupervised Representation Learning

Recent advances in deep clustering and unsupervised representation learn...
research
01/10/2022

Information-Theoretic Bias Reduction via Causal View of Spurious Correlation

We propose an information-theoretic bias measurement technique through a...
research
04/05/2022

Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute Estimation

The paradigm of worst-group loss minimization has shown its promise in a...
research
09/27/2022

Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning

The pioneering method for unsupervised meta-learning, CACTUs, is a clust...
research
06/02/2021

A Cluster-based Approach for Improving Isotropy in Contextual Embedding Space

The representation degeneration problem in Contextual Word Representatio...

Please sign up or login with your details

Forgot password? Click here to reset