Self-labelling via simultaneous clustering and representation learning

11/13/2019
by   Yuki Markus Asano, et al.
83

Combining clustering and representation learning is one of the most promising approaches for unsupervised learning of deep neural networks. However, doing so naively leads to ill posed learning problems with degenerate solutions. In this paper, we propose a novel and principled learning formulation that addresses these issues. The method is obtained by maximizing the information between labels and input data indices. We show that this criterion extends standard cross-entropy minimization to an optimal transport problem, which we solve efficiently for millions of input images and thousands of labels using a fast variant of the Sinkhorn-Knopp algorithm. The resulting method is able to self-label visual data so as to train highly competitive image representations without manual labels. Compared to the best previous method in this class, namely DeepCluster, our formulation minimizes a single objective function for both representation learning and clustering; it also significantly outperforms DeepCluster in standard benchmarks and reaches state of the art for learning a ResNet-50 self-supervisedly.

READ FULL TEXT

page 15

page 18

page 19

page 20

page 21

research
02/05/2022

Unsupervised Learning on 3D Point Clouds by Clustering and Contrasting

Learning from unlabeled or partially labeled data to alleviate human lab...
research
03/19/2021

Self-Supervised Classification Network

We present Self-Classifier – a novel self-supervised end-to-end classifi...
research
10/03/2020

Consensus Clustering with Unsupervised Representation Learning

Recent advances in deep clustering and unsupervised representation learn...
research
09/11/2021

Learning Statistical Representation with Joint Deep Embedded Clustering

One of the most promising approaches for unsupervised learning is combin...
research
07/24/2021

Clustering by Maximizing Mutual Information Across Views

We propose a novel framework for image clustering that incorporates join...
research
09/13/2021

Online Unsupervised Learning of Visual Representations and Categories

Real world learning scenarios involve a nonstationary distribution of cl...
research
02/26/2018

Learning Anonymized Representations with Adversarial Neural Networks

Statistical methods protecting sensitive information or the identity of ...

Please sign up or login with your details

Forgot password? Click here to reset