On Convergence of Epanechnikov Mean Shift

11/20/2017
by   Kejun Huang, et al.
0

Epanechnikov Mean Shift is a simple yet empirically very effective algorithm for clustering. It localizes the centroids of data clusters via estimating modes of the probability distribution that generates the data points, using the `optimal' Epanechnikov kernel density estimator. However, since the procedure involves non-smooth kernel density functions, the convergence behavior of Epanechnikov mean shift lacks theoretical support as of this writing---most of the existing analyses are based on smooth functions and thus cannot be applied to Epanechnikov Mean Shift. In this work, we first show that the original Epanechnikov Mean Shift may indeed terminate at a non-critical point, due to the non-smoothness nature. Based on our analysis, we propose a simple remedy to fix it. The modified Epanechnikov Mean Shift is guaranteed to terminate at a local maximum of the estimated density, which corresponds to a cluster centroid, within a finite number of iterations. We also propose a way to avoid running the Mean Shift iterates from every data point, while maintaining good clustering accuracies under non-overlapping spherical Gaussian mixture models. This further pushes Epanechnikov Mean Shift to handle very large and high-dimensional data sets. Experiments show surprisingly good performance compared to the Lloyd's K-means algorithm and the EM algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2020

Automated Clustering of High-dimensional Data with a Feature Weighted Mean Shift Algorithm

Mean shift is a simple interactive procedure that gradually shifts data ...
research
03/02/2015

A review of mean-shift algorithms for clustering

A natural way to characterize the cluster structure of a dataset is by f...
research
01/07/2020

Generalized mean shift with triangular kernel profile

The mean shift algorithm is a popular way to find modes of some probabil...
research
04/20/2021

Space Partitioning and Regression Mode Seeking via a Mean-Shift-Inspired Algorithm

The mean shift (MS) algorithm is a nonparametric method used to cluster ...
research
08/06/2014

The functional mean-shift algorithm for mode hunting and clustering in infinite dimensions

We introduce the functional mean-shift algorithm, an iterative algorithm...
research
02/11/2019

A Distributed and Approximated Nearest Neighbors Algorithm for an Efficient Large Scale Mean Shift Clustering

In this paper we target the class of modal clustering methods where clus...
research
10/31/2018

Scalable Laplacian K-modes

We advocate Laplacian K-modes for joint clustering and density mode find...

Please sign up or login with your details

Forgot password? Click here to reset