Consensus Clustering with Unsupervised Representation Learning

10/03/2020
by   Jayanth Reddy Regatti, et al.
0

Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must either be closer in the representation space, or have a similar cluster assignment. In this work, we leverage this idea together with ensemble learning to perform clustering and representation learning. Ensemble learning is widely used in the supervised learning setting but has not yet been practical in deep clustering. Previous works on ensemble learning for clustering neither work on the feature space nor learn features. We propose a novel ensemble learning algorithm dubbed Consensus Clustering with Unsupervised Representation Learning (ConCURL) which learns representations by creating a consensus on multiple clustering outputs. Specifically, we generate a cluster ensemble using random transformations on the embedding space, and define a consensus loss function that measures the disagreement among the constituents of the ensemble. Thus, diverse ensembles minimize this loss function in a synergistic way, which leads to better representations that work with all cluster ensemble constituents. Our proposed method ConCURL is easy to implement and integrate into any representation learning or deep clustering block. ConCURL outperforms all state of the art methods on various computer vision datasets. Specifically, we beat the closest state of the art method by 5.9 percent on the ImageNet-10 dataset, and by 18 percent on the ImageNet-Dogs dataset in terms of clustering accuracy. We further shed some light on the under-studied overfitting issue in clustering and show that our method does not overfit as much as existing methods, and thereby generalizes better for new data samples.

READ FULL TEXT
research
05/04/2021

Representation Learning for Clustering via Building Consensus

In this paper, we focus on deep clustering and unsupervised representati...
research
10/13/2022

Deep Clustering With Consensus Representations

The field of deep clustering combines deep learning and clustering to le...
research
01/19/2019

Deep Representation Learning Characterized by Inter-class Separation for Image Clustering

Despite significant advances in clustering methods in recent years, the ...
research
06/11/2021

Learning the Precise Feature for Cluster Assignment

Clustering is one of the fundamental tasks in computer vision and patter...
research
06/22/2023

AugDMC: Data Augmentation Guided Deep Multiple Clustering

Clustering aims to group similar objects together while separating dissi...
research
11/13/2019

Self-labelling via simultaneous clustering and representation learning

Combining clustering and representation learning is one of the most prom...
research
08/06/2021

Unsupervised Learning of Debiased Representations with Pseudo-Attributes

Dataset bias is a critical challenge in machine learning, and its negati...

Please sign up or login with your details

Forgot password? Click here to reset