Efficient Parameter-free Clustering Using First Neighbor Relations

02/28/2019
by   M. Saquib Sarfraz, et al.
0

We present a new clustering method in the form of a single clustering equation that is able to directly discover groupings in the data. The main proposition is that the first neighbor of each sample is all one needs to discover large chains and finding the groups in the data. In contrast to most existing clustering algorithms our method does not require any hyper-parameters, distance thresholds and/or the need to specify the number of clusters. The proposed algorithm belongs to the family of hierarchical agglomerative methods. The technique has a very low computational overhead, is easily scalable and applicable to large practical problems. Evaluation on well known datasets from different domains ranging between 1077 and 8.1 million samples shows substantial performance gains when compared to the existing clustering techniques.

READ FULL TEXT
research
09/26/2017

Adaptive Nonparametric Clustering

This paper presents a new approach to non-parametric cluster analysis ca...
research
06/08/2021

ParChain: A Framework for Parallel Hierarchical Agglomerative Clustering using Nearest-Neighbor Chain

This paper studies the hierarchical clustering problem, where the goal i...
research
03/14/2017

A Random Finite Set Model for Data Clustering

The goal of data clustering is to partition data points into groups to m...
research
04/08/2019

CRAD: Clustering with Robust Autocuts and Depth

We develop a new density-based clustering algorithm named CRAD which is ...
research
04/24/2014

Solution Path Clustering with Adaptive Concave Penalty

Fast accumulation of large amounts of complex data has created a need fo...
research
04/28/2021

A Deep Learning Object Detection Method for an Efficient Clusters Initialization

Clustering is an unsupervised machine learning method grouping data samp...
research
01/29/2015

Bayesian Hierarchical Clustering with Exponential Family: Small-Variance Asymptotics and Reducibility

Bayesian hierarchical clustering (BHC) is an agglomerative clustering me...

Please sign up or login with your details

Forgot password? Click here to reset