Deep clustering: On the link between discriminative models and K-means

10/09/2018
by   Mohammed Jabi, et al.
0

In the context of recent deep clustering studies, discriminative models dominate the literature and report the most competitive performances. These models learn a deep discriminative neural network classifier in which the labels are latent. Typically, they use multinomial logistic regression posteriors and parameter regularization, as is very common in supervised learning. It is generally acknowledged that discriminative objective functions (e.g., those based on the mutual information or the KL divergence) are more flexible than generative approaches (e.g., K-means) in the sense that they make fewer assumptions about the data distributions and, typically, yield much better unsupervised deep learning results. On the surface, several recent discriminative models may seem unrelated to K-means. This study shows that these models are, in fact, equivalent to K-means under mild conditions and common posterior models and parameter regularization. We prove that, for the commonly used logistic regression posteriors, maximizing the L_2 regularized mutual information via an approximate alternating direction method (ADM) is equivalent to a soft and regularized K-means loss. Our theoretical analysis not only connects directly several recent state-of-the-art discriminative models to K-means, but also leads to a new soft and regularized deep K-means algorithm, which yields competitive performance on several image clustering benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2023

Generalised Mutual Information: a Framework for Discriminative Clustering

In the last decade, recent successes in deep clustering majorly involved...
research
10/12/2022

Generalised Mutual Information for Discriminative Clustering

In the last decade, recent successes in deep clustering majorly involved...
research
01/26/2023

Revisiting Discriminative Entropy Clustering and its relation to K-means

Maximization of mutual information between the model's input and output ...
research
11/06/2015

Towards a Better Understanding of Predict and Count Models

In a recent paper, Levy and Goldberg pointed out an interesting connecti...
research
02/13/2019

Deep Divergence-Based Approach to Clustering

A promising direction in deep learning research consists in learning rep...
research
03/19/2020

Unsupervised Domain Adaptation via Structurally Regularized Deep Clustering

Unsupervised domain adaptation (UDA) is to make predictions for unlabele...
research
10/02/2020

Regularized K-means through hard-thresholding

We study a framework of regularized K-means methods based on direct pena...

Please sign up or login with your details

Forgot password? Click here to reset