Clustering via Mode Seeking by Direct Estimation of the Gradient of a Log-Density

04/20/2014
by   Hiroaki Sasaki, et al.
0

Mean shift clustering finds the modes of the data probability density by identifying the zero points of the density gradient. Since it does not require to fix the number of clusters in advance, the mean shift has been a popular clustering algorithm in various application fields. A typical implementation of the mean shift is to first estimate the density by kernel density estimation and then compute its gradient. However, since good density estimation does not necessarily imply accurate estimation of the density gradient, such an indirect two-step approach is not reliable. In this paper, we propose a method to directly estimate the gradient of the log-density without going through density estimation. The proposed method gives the global solution analytically and thus is computationally efficient. We then develop a mean-shift-like fixed-point algorithm to find the modes of the density for clustering. As in the mean shift, one does not need to set the number of clusters in advance. We empirically show that the proposed clustering method works much better than the mean shift especially for high-dimensional data. Experimental results further indicate that the proposed method outperforms existing clustering methods.

READ FULL TEXT
research
08/01/2015

Regularized Multi-Task Learning for Multi-Dimensional Log-Density Gradient Estimation

Log-density gradient estimation is a fundamental statistical problem and...
research
05/10/2018

Analysis of a Mode Clustering Diagram

Mode-based clustering methods define clusters to be the basins of attrac...
research
04/01/2021

MeanShift++: Extremely Fast Mode-Seeking With Applications to Segmentation and Object Tracking

MeanShift is a popular mode-seeking clustering algorithm used in a wide ...
research
04/14/2015

Clustering Assisted Fundamental Matrix Estimation

In computer vision, the estimation of the fundamental matrix is a basic ...
research
12/16/2019

Finding the Mode of a Kernel Density Estimate

Given points p_1, ..., p_n in R^d, how do we find a point x which maximi...
research
02/11/2019

A Distributed and Approximated Nearest Neighbors Algorithm for an Efficient Large Scale Mean Shift Clustering

In this paper we target the class of modal clustering methods where clus...
research
10/31/2018

Scalable Laplacian K-modes

We advocate Laplacian K-modes for joint clustering and density mode find...

Please sign up or login with your details

Forgot password? Click here to reset