Clustering Optimization: Finding the Number and Centroids of Clusters by a Fourier-based Algorithm

04/29/2019
by   Soheil Mehrabkhani, et al.
0

We propose a Fourier-based approach for optimization of several clustering algorithms. Mathematically, clusters data can be described by a density function represented by the Dirac mixture distribution. The density function can be smoothed by applying the Fourier transform and a Gaussian filter. The determination of the optimal standard deviation of the Gaussian filter will be accomplished by the use of a convergence criterion related to the correlation between the smoothed and the original density functions. In principle, the optimal smoothed density function exhibits local maxima, which correspond to the cluster centroids. Thus, the complex task of finding the centroids of the clusters is simplified by the detection of the peaks of the smoothed density function. A multiple sliding windows procedure is used to detect the peaks. The remarkable accuracy of the proposed algorithm demonstrates its capability as a reliable general method for enhancement of the clustering performance, its global optimization and also removing the initialization problem in many clustering methods.

READ FULL TEXT
research
06/24/2019

Density-based Clustering with Best-scored Random Forest

Single-level density-based approach has long been widely acknowledged to...
research
02/10/2020

A fast and efficient Modal EM algorithm for Gaussian mixtures

In the modal approach to clustering, clusters are defined as the local m...
research
12/26/2019

Parameter Free Clustering with Cluster Catch Digraphs (Technical Report)

We propose clustering algorithms based on a recently developed geometric...
research
10/05/2018

CDF Transform-Shift: An effective way to deal with inhomogeneous density datasets

Many distance-based algorithms exhibit bias towards dense clusters in in...
research
09/05/2023

Superclustering by finding statistically significant separable groups of optimal gaussian clusters

The paper presents the algorithm for clustering a dataset by grouping th...
research
08/09/2018

α-Approximation Density-based Clustering of Multi-valued Objects

Multi-valued data are commonly found in many real applications. During t...
research
05/16/2020

Revisiting Agglomerative Clustering

In data clustering, emphasis is often placed in finding groups of points...

Please sign up or login with your details

Forgot password? Click here to reset