Scaling Analysis of Affinity Propagation

10/09/2009
by   Cyril Furtlehner, et al.
0

We analyze and exploit some scaling properties of the Affinity Propagation (AP) clustering algorithm proposed by Frey and Dueck (2007). First we observe that a divide and conquer strategy, used on a large data set hierarchically reduces the complexity O(N^2) to O(N^(h+2)/(h+1)), for a data-set of size N and a depth h of the hierarchical strategy. For a data-set embedded in a d-dimensional space, we show that this is obtained without notably damaging the precision except in dimension d=2. In fact, for d larger than 2 the relative loss in precision scales like N^(2-d)/(h+1)d. Finally, under some conditions we observe that there is a value s^* of the penalty coefficient, a free parameter used to fix the number of clusters, which separates a fragmentation phase (for s<s^*) from a coalescent one (for s>s^*) of the underlying hidden cluster structure. At this precise point holds a self-similarity property which can be exploited by the hierarchical strategy to actually locate its position. From this observation, a strategy based on can be defined to find out how many clusters are present in a given dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2015

A Generalized Affinity Propagation Clustering Algorithm for Nonspherical Cluster Discovery

Clustering analysis aims to discover the underlying clusters in the data...
research
10/01/2022

A new nonparametric interpoint distance-based measure for assessment of clustering

A new interpoint distance-based measure is proposed to identify the opti...
research
10/09/2009

Local and global approaches of affinity propagation clustering for large scale data

Recently a new clustering algorithm called 'affinity propagation' (AP) h...
research
01/06/2022

A new measure for assessment of clustering based on kernel density estimation

A new clustering accuracy measure is proposed to determine the unknown n...
research
09/09/2021

Compositional Affinity Propagation: When Clusters Have Compositional Structure

We consider a new kind of clustering problem in which clusters need not ...
research
03/06/2021

The Automatic Quasi-clique Merger algorithm (AQCM)

The Automatic Quasi-clique Merger algorithm is a new algorithm adapted f...
research
03/26/2021

Geometric Affinity Propagation for Clustering with Network Knowledge

Clustering data into meaningful subsets is a major task in scientific da...

Please sign up or login with your details

Forgot password? Click here to reset