Modal clustering asymptotics with applications to bandwidth selection

01/22/2019
by   Alessandro Casa, et al.
0

Density-based clustering relies on the idea of linking groups to some specific features of the probability distribution underlying the data. The reference to a true, yet unknown, population structure allows to frame the clustering problem in a standard inferential setting, where the concept of ideal population clustering is defined as the partition induced by the true density function. The nonparametric formulation of this approach, known as modal clustering, draws a correspondence between the groups and the domains of attraction of the density modes. Operationally, a nonparametric density estimate is required and a proper selection of the amount of smoothing, governing the shape of the density and hence possibly the modal structure, is crucial to identify the final partition. In this work, we address the issue of density estimation for modal clustering from an asymptotic perspective. A natural and easy to interpret metric to measure the distance between density-based partitions is discussed, its asymptotic approximation explored, and employed to study the problem of bandwidth selection for nonparametric modal clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

Modal clustering of matrix-variate data

The nonparametric formulation of density-based clustering, known as moda...
research
12/04/2014

Nonparametric modal regression

Modal regression estimates the local modes of the distribution of Y give...
research
08/07/2021

Clustering Large Data Sets with Incremental Estimation of Low-density Separating Hyperplanes

An efficient method for obtaining low-density hyperplane separators in t...
research
11/15/2019

How bettering the best? Answers via blending models and cluster formulations in density-based clustering

With the recent growth in data availability and complexity, and the asso...
research
08/06/2014

A Population Background for Nonparametric Density-Based Clustering

Despite its popularity, it is widely recognized that the investigation o...
research
12/06/2012

Clusters and water flows: a novel approach to modal clustering through Morse theory

The problem of finding groups in data (cluster analysis) has been extens...
research
01/20/2021

Density-based clustering of social networks

The idea underlying the modal formulation of density-based clustering is...

Please sign up or login with your details

Forgot password? Click here to reset