A new interpoint distance-based clustering algorithm using kernel density estimation

04/28/2023
by   Dr. Soumita Modak, et al.
0

A novel nonparametric clustering algorithm is proposed using the interpoint distances between the members of the data to reveal the inherent clustering structure existing in the given set of data, where we apply the classical nonparametric univariate kernel density estimation method to the interpoint distances to estimate the density around a data member. Our clustering algorithm is simple in its formation and easy to apply resulting in well-defined clusters. The algorithm starts with objective selection of the initial cluster representative and always converges independently of this choice. The method finds the number of clusters itself and can be used irrespective of the nature of underlying data by using an appropriate interpoint distance measure. The cluster analysis can be carried out in any dimensional space with viability to high-dimensional use. The distributions of the data or their interpoint distances are not required to be known due to the design of our procedure, except the assumption that the interpoint distances possess a density function. Data study shows its effectiveness and superiority over the widely used clustering algorithms.

READ FULL TEXT

page 28

page 29

research
01/06/2022

A new measure for assessment of clustering based on kernel density estimation

A new clustering accuracy measure is proposed to determine the unknown n...
research
09/09/2021

On the use of Wasserstein metric in topological clustering of distributional data

This paper deals with a clustering algorithm for histogram data based on...
research
07/28/2020

Collective Spectral Density Estimation and Clustering for Spatially-Correlated Data

In this paper, we develop a method for estimating and clustering two-dim...
research
04/10/2010

New Clustering Algorithm for Vector Quantization using Rotation of Error Vector

The paper presents new clustering algorithm. The proposed algorithm give...
research
08/22/2021

The Exploitation of Distance Distributions for Clustering

Although distance measures are used in many machine learning algorithms,...
research
12/27/2019

Nonlinear Markov Clustering by Minimum Curvilinear Sparse Similarity

The development of algorithms for unsupervised pattern recognition by no...
research
07/21/2022

Fast Data Driven Estimation of Cluster Number in Multiplex Images using Embedded Density Outliers

The usage of chemical imaging technologies is becoming a routine accompa...

Please sign up or login with your details

Forgot password? Click here to reset