A new nonparametric interpoint distance-based measure for assessment of clustering

10/01/2022
by   Soumita Modak, et al.
0

A new interpoint distance-based measure is proposed to identify the optimal number of clusters present in a data set. Designed in nonparametric approach, it is independent of the distribution of given data. Interpoint distances between the data members make our cluster validity index applicable to univariate and multivariate data measured on arbitrary scales, or having observations in any dimensional space where the number of study variables can be even larger than the sample size. Our proposed criterion is compatible with any clustering algorithm, and can be used to determine the unknown number of clusters or to assess the quality of the resulting clusters for a data set. Demonstration through synthetic and real-life data establishes its superiority over the well-known clustering accuracy measures of the literature.

READ FULL TEXT
research
01/06/2022

A new measure for assessment of clustering based on kernel density estimation

A new clustering accuracy measure is proposed to determine the unknown n...
research
09/03/2021

J-Score: A Robust Measure of Clustering Accuracy

Background. Clustering analysis discovers hidden structures in a data se...
research
02/07/2019

Online Clustering by Penalized Weighted GMM

With the dawn of the Big Data era, data sets are growing rapidly. Data i...
research
10/09/2009

Scaling Analysis of Affinity Propagation

We analyze and exploit some scaling properties of the Affinity Propagati...
research
09/13/2022

Genie: A new, fast, and outlier-resistant hierarchical clustering algorithm

The time needed to apply a hierarchical clustering algorithm is most oft...
research
01/26/2021

Applications of Clustering with Mixed Type Data in Life Insurance

Death benefits are generally the largest cash flow item that affects fin...
research
07/04/2019

k is the Magic Number -- Inferring the Number of Clusters Through Nonparametric Concentration Inequalities

Most convex and nonconvex clustering algorithms come with one crucial pa...

Please sign up or login with your details

Forgot password? Click here to reset