Mode-Seeking Clustering and Density Ridge Estimation via Direct Estimation of Density-Derivative-Ratios

by   Hiroaki Sasaki, et al.

Modes and ridges of the probability density function behind observed data are useful geometric features. Mode-seeking clustering assigns cluster labels by associating data samples with the nearest modes, and estimation of density ridges enables us to find lower-dimensional structures hidden in data. A key technical challenge both in mode-seeking clustering and density ridge estimation is accurate estimation of the ratios of the first- and second-order density derivatives to the density. A naive approach takes a three-step approach of first estimating the data density, then computing its derivatives, and finally taking their ratios. However, this three-step approach can be unreliable because a good density estimator does not necessarily mean a good density derivative estimator, and division by the estimated density could significantly magnify the estimation error. To cope with these problems, we propose a novel estimator for the density-derivative-ratios. The proposed estimator does not involve density estimation, but rather directly approximates the ratios of density derivatives of any order. Moreover, we establish a convergence rate of the proposed estimator. Based on the proposed estimator, new methods both for mode-seeking clustering and density ridge estimation are developed, and corresponding convergence rates to the mode and ridge of the underlying density are also established. Finally, we experimentally demonstrate that the developed methods significantly outperform existing methods, particularly for relatively high-dimensional data.


page 1

page 2

page 3

page 4


Direct Density-Derivative Estimation and Its Application in KL-Divergence Approximation

Estimation of density derivatives is a versatile tool in statistical dat...

Uniform Convergence Rate of the Kernel Density Estimator Adaptive to Intrinsic Dimension

We derive concentration inequalities for the supremum norm of the differ...

Density Estimation via Adaptive Partition and Discrepancy Control

Given iid samples from some unknown continuous density on hyper-rectangl...

Regularized Multi-Task Learning for Multi-Dimensional Log-Density Gradient Estimation

Log-density gradient estimation is a fundamental statistical problem and...

Robust modal regression with direct log-density derivative estimation

Modal regression is aimed at estimating the global mode (i.e., global ma...

Density Estimation via Discrepancy

Given i.i.d samples from some unknown continuous density on hyper-rectan...

Density-Difference Estimation

We address the problem of estimating the difference between two probabil...

Please sign up or login with your details

Forgot password? Click here to reset