An agglomerative hierarchical clustering method by optimizing the average silhouette width

09/26/2019
by   Fatima Batool, et al.
0

An agglomerative hierarchical clustering (AHC) framework and algorithm named HOSil based on a new linkage metric optimized by the average silhouette width (ASW) index is proposed. A conscientious investigation of various clustering methods and estimation indices is conducted across a diverse verities of data structures for three aims: a) clustering quality, b) clustering recovery, and c) estimation of number of clusters. HOSil has shown better clustering quality for a range of artificial and real world data structures as compared to k-means, PAM, single, complete, average, Ward, McQuitty, spectral, model-based, and several estimation methods. It can identify clusters of various shapes including spherical, elongated, relatively small sized clusters, clusters coming from different distributions including uniform, t, gamma and others. HOSil has shown good recovery for correct determination of the number of clusters. For some data structures only HOSil was able to identify the correct number of clusters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2019

Clustering by Optimizing the Average Silhouette Width

In this paper, we propose a unified clustering approach that can estimat...
research
10/24/2019

Characterization and Development of Average Silhouette Width Clustering

The purpose of this paper is to introduced a new clustering methodology....
research
09/12/2009

Clustering Based on Pairwise Distances When the Data is of Mixed Dimensions

In the context of clustering, we consider a generative model in a Euclid...
research
06/10/2021

Swarm Intelligence for Self-Organized Clustering

Algorithms implementing populations of agents which interact with one an...
research
09/21/2016

AMOS: An Automated Model Order Selection Algorithm for Spectral Graph Clustering

One of the longstanding problems in spectral graph clustering (SGC) is t...
research
10/19/2017

Frequency Based Index Estimating the Subclusters' Connection Strength

In this paper, a frequency coefficient based on the Sen-Shorrocks-Thon (...
research
06/27/2023

Non-parametric online market regime detection and regime clustering for multidimensional and path-dependent data structures

In this work we present a non-parametric online market regime detection ...

Please sign up or login with your details

Forgot password? Click here to reset