A Parameter-free Affinity Based Clustering

07/20/2015
by   Bhaskar Mukhoty, et al.
0

Several methods have been proposed to estimate the number of clusters in a dataset; the basic ideal behind all of them has been to study an index that measures inter-cluster separation and intra-cluster cohesion over a range of cluster numbers and report the number which gives an optimum value of the index. In this paper we propose a simple, parameter free approach that is like human cognition to form clusters, where closely lying points are easily identified to form a cluster and total number of clusters are revealed. To identify closely lying points, affinity of two points is defined as a function of distance and a threshold affinity is identified, above which two points in a dataset are likely to be in the same cluster. Well separated clusters are identified even in the presence of outliers, whereas for not so well separated dataset, final number of clusters are estimated and the detected clusters are merged to produce the final clusters. Experiments performed with several large dimensional synthetic and real datasets show good results with robustness to noise and density variation within dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2018

Cluster validity index based on Jeffrey divergence

Cluster validity indexes are very important tools designed for two purpo...
research
12/29/2021

VDPC: Variational Density Peak Clustering Algorithm

The widely applied density peak clustering (DPC) algorithm makes an intu...
research
06/09/2021

On Clusters that are Separated but Large

Given a set P of n points in ^d, consider the problem of computing k sub...
research
12/02/2019

Identifying the number of clusters for K-Means: A hypersphere density based approach

Application of K-Means algorithm is restricted by the fact that the numb...
research
07/27/2020

Modeling the Influence of Visual Density on Cluster Perception in Scatterplots Using Topology

Scatterplots are used for a variety of visual analytics tasks, including...
research
01/12/2017

Light Source Point Cluster Selection Based Atmosphere Light Estimation

Atmosphere light value is a highly critical parameter in defogging algor...
research
11/15/2019

Penalized k-means algorithms for finding the correct number of clusters in a dataset

In many applications we want to find the number of clusters in a dataset...

Please sign up or login with your details

Forgot password? Click here to reset