Stability of Density-Based Clustering

11/11/2010
by   Alessandro Rinaldo, et al.
0

High density clusters can be characterized by the connected components of a level set L(λ) = {x: p(x)>λ} of the underlying probability density function p generating the data, at some appropriate level λ≥ 0. The complete hierarchical clustering can be characterized by a cluster tree T= _λ L(λ). In this paper, we study the behavior of a density level set estimate L(λ) and cluster tree estimate T based on a kernel density estimator with kernel bandwidth h. We define two notions of instability to measure the variability of L(λ) and T as a function of h, and investigate the theoretical properties of these instability measures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2017

Adaptive Clustering Using Kernel Density Estimators

We investigate statistical properties of a clustering algorithm that rec...
research
09/21/2016

On Data-Independent Properties for Density-Based Dissimilarity Measures in Hybrid Clustering

Hybrid clustering combines partitional and hierarchical clustering for c...
research
05/20/2016

Statistical Inference for Cluster Trees

A cluster tree provides a highly-interpretable summary of a density func...
research
08/15/2015

Towards an Axiomatic Approach to Hierarchical Clustering of Measures

We propose some axioms for hierarchical clustering of probability measur...
research
09/30/2014

Fully adaptive density-based clustering

The clusters of a distribution are often defined by the connected compon...
research
11/19/2021

An Asymptotic Equivalence between the Mean-Shift Algorithm and the Cluster Tree

Two important nonparametric approaches to clustering emerged in the 1970...
research
07/30/2013

DeBaCl: A Python Package for Interactive DEnsity-BAsed CLustering

The level set tree approach of Hartigan (1975) provides a probabilistica...

Please sign up or login with your details

Forgot password? Click here to reset