Initialization Free Graph Based Clustering

09/24/2009
by   Laurent Galluccio, et al.
0

This paper proposes an original approach to cluster multi-component data sets, including an estimation of the number of clusters. From the construction of a minimal spanning tree with Prim's algorithm, and the assumption that the vertices are approximately distributed according to a Poisson distribution, the number of clusters is estimated by thresholding the Prim's trajectory. The corresponding cluster centroids are then computed in order to initialize the generalized Lloyd's algorithm, also known as K-means, which allows to circumvent initialization problems. Some results are derived for evaluating the false positive rate of our cluster detection algorithm, with the help of approximations relevant in Euclidean spaces. Metrics used for measuring similarity between multi-dimensional data points are based on symmetrical divergences. The use of these informational divergences together with the proposed method leads to better results, compared to other clustering methods for the problem of astrophysical data processing. Some applications of this method in the multi/hyper-spectral imagery domain to a satellite view of Paris and to an image of the Mars planet are also presented. In order to demonstrate the usefulness of divergences in our problem, the method with informational divergence as similarity measure is compared with the same method using classical metrics. In the astrophysics application, we also compare the method with the spectral clustering algorithms.

READ FULL TEXT

page 12

page 15

research
04/24/2019

Construction of the similarity matrix for the spectral clustering method: numerical experiments

Spectral clustering is a powerful method for finding structure in a data...
research
02/01/2022

Spectral Clustering, Spanning Forest, and Bayesian Forest Process

Spectral clustering algorithms are very popular. Starting from a pairwis...
research
11/12/2017

Unified Spectral Clustering with Optimal Graph

Spectral clustering has found extensive use in many areas. Most traditio...
research
02/25/2023

A parameter-free graph reduction for spectral clustering and SpectralNet

Graph-based clustering methods like spectral clustering and SpectralNet ...
research
08/15/2023

Parametric entropy based Cluster Centriod Initialization for k-means clustering of various Image datasets

One of the most employed yet simple algorithm for cluster analysis is th...
research
04/25/2016

Weighted Spectral Cluster Ensemble

Clustering explores meaningful patterns in the non-labeled data sets. Cl...
research
05/29/2022

An adaptive granularity clustering method based on hyper-ball

The purpose of cluster analysis is to classify elements according to the...

Please sign up or login with your details

Forgot password? Click here to reset