Border-Peeling Clustering

12/14/2016
by   Nadav Bar, et al.
0

In this paper, we present a novel non-parametric clustering technique, which is based on an iterative algorithm that peels off layers of points around the clusters. Our technique is based on the notion that each latent cluster is comprised of layers that surround its core, where the external layers, or border points, implicitly separate the clusters. Analyzing the K-nearest neighbors of the points makes it possible to identify the border points and associate them with points of inner layers. Our clustering algorithm iteratively identifies border points, peels them, and separates the latent clusters. We show that the peeling process adapts to the local density and successfully separates adjacent clusters. A notable quality of the Border-Peeling algorithm is that it does not require any parameter tuning in order to outperform state-of-the-art finely-tuned non-parametric clustering methods, including Mean-Shift and DBSCAN. We further assess our technique on high-dimensional datasets that vary in size and characteristics. In particular, we analyze the space of deep features that were trained by a convolutional neural network.

READ FULL TEXT
research
10/05/2016

Non-Parametric Cluster Significance Testing with Reference to a Unimodal Null Distribution

Cluster analysis is an unsupervised learning strategy that can be employ...
research
09/26/2017

Adaptive Nonparametric Clustering

This paper presents a new approach to non-parametric cluster analysis ca...
research
03/22/2018

Clustering-driven Deep Embedding with Pairwise Constraints

Recently, there has been increasing interest to leverage the competence ...
research
08/27/2022

Geometrical Homogeneous Clustering for Image Data Reduction

In this paper, we present novel variations of an earlier approach called...
research
05/29/2023

DMS: Differentiable Mean Shift for Dataset Agnostic Task Specific Clustering Using Side Information

We present a novel approach, in which we learn to cluster data directly ...
research
09/14/2019

A highly likely clusterable data model with no clusters

We propose a model for a dataset in R^D that does not contain any clust...
research
03/08/2016

A Bayesian non-parametric method for clustering high-dimensional binary data

In many real life problems, objects are described by large number of bin...

Please sign up or login with your details

Forgot password? Click here to reset