Clustering by Direct Optimization of the Medoid Silhouette

09/26/2022
by   Lars Lenssen, et al.
0

The evaluation of clustering results is difficult, highly dependent on the evaluated data set and the perspective of the beholder. There are many different clustering quality measures, which try to provide a general measure to validate clustering results. A very popular measure is the Silhouette. We discuss the efficient medoid-based variant of the Silhouette, perform a theoretical analysis of its properties, and provide two fast versions for the direct optimization. We combine ideas from the original Silhouette with the well-known PAM algorithm and its latest improvements FasterPAM. One of the versions guarantees equal results to the original variant and provides a run speedup of O(k^2). In experiments on real data with 30000 samples and k=100, we observed a 10464× speedup compared to the original PAMMEDSIL algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2023

Medoid Silhouette clustering with automatic cluster number selection

The evaluation of clustering results is difficult, highly dependent on t...
research
10/12/2018

Faster k-Medoids Clustering: Improving the PAM, CLARA, and CLARANS Algorithms

Clustering non-Euclidean data is difficult, and one of the most used alg...
research
03/09/2018

A local depth measure for general data

We herein introduce a general local depth measure for data in a Banach s...
research
05/29/2018

A Novel Multi-clustering Method for Hierarchical Clusterings, Based on Boosting

Bagging and boosting are proved to be the best methods of building multi...
research
07/22/2020

Frank-Wolfe Optimization for Dominant Set Clustering

We study Frank-Wolfe algorithms – standard, pairwise, and away-steps – f...
research
02/25/2015

Exploiting a comparability mapping to improve bi-lingual data categorization: a three-mode data analysis perspective

We address in this paper the co-clustering and co-classification of bili...
research
09/18/2019

Most General Variant Unifiers

Equational unification of two terms consists of finding a substitution t...

Please sign up or login with your details

Forgot password? Click here to reset