Clustering by Optimizing the Average Silhouette Width

10/18/2019
by   Fatima Batool, et al.
14

In this paper, we propose a unified clustering approach that can estimate number of clusters and produce clustering against this number simultaneously. Average silhouette width (ASW) is a widely used standard cluster quality index. We define a distance based objective function that optimizes ASW for clustering. The proposed algorithm named as OSil, only, needs data observations as an input without any prior knowledge of the number of clusters. This work is about thorough investigation of the proposed methodology, its usefulness and limitations. A vast spectrum of clustering structures were generated, and several well-known clustering methods including partitioning, hierarchical, density based, and spatial methods were consider as the competitor of the proposed methodology. Simulation reveals that OSil algorithm has shown superior perform in terms of clustering quality than all clustering methods included in the study. OSil can find well separated, compact clusters and have shown better performance for the estimation of number of clusters than several methods. Apart from the proposal of the new methodology and it's investigation this papers offer a systematic analysis on the estimation of cluster indices, some of which never appeared together in comparative simulation setup before. The study offers many insightful findings useful for the selection of the clustering methods and indices.

READ FULL TEXT

page 33

page 34

page 37

page 40

page 41

research
09/26/2019

An agglomerative hierarchical clustering method by optimizing the average silhouette width

An agglomerative hierarchical clustering (AHC) framework and algorithm n...
research
10/24/2019

Characterization and Development of Average Silhouette Width Clustering

The purpose of this paper is to introduced a new clustering methodology....
research
10/17/2018

Structural Equation Modeling and simultaneous clustering through the Partial Least Squares algorithm

The identification of different homogeneous groups of observations and t...
research
08/03/2016

Improving Quality of Hierarchical Clustering for Large Data Series

Brown clustering is a hard, hierarchical, bottom-up clustering of words ...
research
01/06/2022

A new measure for assessment of clustering based on kernel density estimation

A new clustering accuracy measure is proposed to determine the unknown n...
research
05/27/2023

Overlapping Indices for Dynamic Information Borrowing in Bayesian Hierarchical Modeling

Bayesian hierarchical model (BHM) has been widely used in synthesizing i...
research
12/18/2019

s-DRN: Stabilized Developmental Resonance Network

Online incremental clustering of sequentially incoming data without prio...

Please sign up or login with your details

Forgot password? Click here to reset