Generalised Mutual Information for Discriminative Clustering

10/12/2022
by   Louis Ohl, et al.
0

In the last decade, recent successes in deep clustering majorly involved the mutual information (MI) as an unsupervised objective for training neural networks with increasing regularisations. While the quality of the regularisations have been largely discussed for improvements, little attention has been dedicated to the relevance of MI as a clustering objective. In this paper, we first highlight how the maximisation of MI does not lead to satisfying clusters. We identified the Kullback-Leibler divergence as the main reason of this behaviour. Hence, we generalise the mutual information by changing its core distance, introducing the generalised mutual information (GEMINI): a set of metrics for unsupervised neural network training. Unlike MI, some GEMINIs do not require regularisations when training. Some of these metrics are geometry-aware thanks to distances or kernels in the data space. Finally, we highlight that GEMINIs can automatically select a relevant number of clusters, a property that has been little studied in deep clustering context where the number of clusters is a priori unknown.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2023

Generalised Mutual Information: a Framework for Discriminative Clustering

In the last decade, recent successes in deep clustering majorly involved...
research
09/26/2022

Deep Fair Clustering via Maximizing and Minimizing Mutual Information

Fair clustering aims to divide data into distinct clusters, while preven...
research
10/04/2021

Clustering with Respect to the Information Distance

We discuss the notion of a dense cluster with respect to the information...
research
02/27/2017

Mutual Information based labelling and comparing clusters

After a clustering solution is generated automatically, labelling these ...
research
10/09/2018

Deep clustering: On the link between discriminative models and K-means

In the context of recent deep clustering studies, discriminative models ...
research
10/03/2019

Information based Deep Clustering: An experimental study

Recently, two methods have shown outstanding performance for clustering ...
research
05/01/2017

Forced to Learn: Discovering Disentangled Representations Without Exhaustive Labels

Learning a better representation with neural networks is a challenging p...

Please sign up or login with your details

Forgot password? Click here to reset