Are Cluster Validity Measures (In)valid?

08/02/2022
by   Marek Gagolewski, et al.
0

Internal cluster validity measures (such as the Calinski-Harabasz, Dunn, or Davies-Bouldin indices) are frequently used for selecting the appropriate number of partitions a dataset should be split into. In this paper we consider what happens if we treat such indices as objective functions in unsupervised learning activities. Is the optimal grouping with regards to, say, the Silhouette index really meaningful? It turns out that many cluster (in)validity indices promote clusterings that match expert knowledge quite poorly. We also introduce a new, well-performing variant of the Dunn index that is built upon OWA operators and the near-neighbour graph so that subspaces of higher density, regardless of their shapes, can be separated from each other better.

READ FULL TEXT

page 3

page 21

page 26

research
09/23/2021

Clustering performance analysis using new correlation based cluster validity indices

There are various cluster validity measures used for evaluating clusteri...
research
05/11/2021

An internal validity index based on density-involved distance

It is crucial to evaluate the quality of clustering results in cluster a...
research
01/08/2018

Online Cluster Validity Indices for Streaming Data

Cluster analysis is used to explore structure in unlabeled data sets in ...
research
01/07/2019

Understanding partition comparison indices based on counting object pairs

In unsupervised machine learning, agreement between partitions is common...
research
08/02/2023

A new approach for evaluating internal cluster validation indices

A vast number of different methods are available for unsupervised classi...
research
02/18/2019

Incremental Cluster Validity Indices for Hard Partitions: Extensions and Comparative Study

Validation is one of the most important aspects of clustering, but most ...
research
06/17/2016

Ground Truth Bias in External Cluster Validity Indices

It has been noticed that some external CVIs exhibit a preferential bias ...

Please sign up or login with your details

Forgot password? Click here to reset