Identifying the number of clusters for K-Means: A hypersphere density based approach

by   Sukavanan Nanjundan, et al.

Application of K-Means algorithm is restricted by the fact that the number of clusters should be known beforehand. Previously suggested methods to solve this problem are either ad hoc or require parametric assumptions and complicated calculations. The proposed method aims to solve this conundrum by considering cluster hypersphere density as the factor to determine the number of clusters in the given dataset. The density is calculated by assuming a hypersphere around the cluster centroid for n-different number of clusters. The calculated values are plotted against their corresponding number of clusters and then the optimum number of clusters is obtained after assaying the elbow region of the graph. The method is simple and easy to comprehend and provides robust and reliable results.



page 1

page 2

page 3

page 4


A Proposed Method for Assessing Cluster Heterogeneity

Assessing how adequate clusters fit a dataset and finding an optimum num...

CNAK : Cluster Number Assisted K-means

Determining the number of clusters present in a dataset is an important ...

A description length approach to determining the number of k-means clusters

We present an asymptotic criterion to determine the optimal number of cl...

SMLSOM: The shrinking maximum likelihood self-organizing map

Determining the number of clusters in a dataset is a fundamental issue i...

Penalized k-means algorithms for finding the correct number of clusters in a dataset

In many applications we want to find the number of clusters in a dataset...

Clustering Optimization: Finding the Number and Centroids of Clusters by a Fourier-based Algorithm

We propose a Fourier-based approach for optimization of several clusteri...

Identifying bias in cluster quality metrics

We study potential biases of popular cluster quality metrics, such as co...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.