DivClust: Controlling Diversity in Deep Clustering

04/03/2023
by   Ioannis Maniadis Metaxas, et al.
0

Clustering has been a major research topic in the field of machine learning, one to which Deep Learning has recently been applied with significant success. However, an aspect of clustering that is not addressed by existing deep clustering methods, is that of efficiently producing multiple, diverse partitionings for a given dataset. This is particularly important, as a diverse set of base clusterings are necessary for consensus clustering, which has been found to produce better and more robust results than relying on a single clustering. To address this gap, we propose DivClust, a diversity controlling loss that can be incorporated into existing deep clustering frameworks to produce multiple clusterings with the desired degree of diversity. We conduct experiments with multiple datasets and deep clustering frameworks and show that: a) our method effectively controls diversity across frameworks and datasets with very small additional computational cost, b) the sets of clusterings learned by DivClust include solutions that significantly outperform single-clustering baselines, and c) using an off-the-shelf consensus clustering algorithm, DivClust produces consensus clustering solutions that consistently outperform single-clustering baselines, effectively improving the performance of the base deep clustering framework.

READ FULL TEXT

page 5

page 14

research
10/13/2022

Deep Clustering With Consensus Representations

The field of deep clustering combines deep learning and clustering to le...
research
02/07/2021

Determinantal consensus clustering

Random restart of a given algorithm produces many partitions to yield a ...
research
04/26/2016

Condorcet's Jury Theorem for Consensus Clustering and its Implications for Diversity

Condorcet's Jury Theorem has been invoked for ensemble classifiers to in...
research
02/08/2021

Large-data determinantal clustering

Determinantal consensus clustering is a promising and attractive alterna...
research
04/30/2014

A Bi-clustering Framework for Consensus Problems

We consider grouping as a general characterization for problems such as ...
research
12/27/2022

Robust Consensus Clustering and its Applications for Advertising Forecasting

Consensus clustering aggregates partitions in order to find a better fit...
research
11/09/2022

An Empirical Study on Clustering Pretrained Embeddings: Is Deep Strictly Better?

Recent research in clustering face embeddings has found that unsupervise...

Please sign up or login with your details

Forgot password? Click here to reset