Nonparametric Nearest Neighbor Random Process Clustering

04/20/2015
by   Michael Tschannen, et al.
0

We consider the problem of clustering noisy finite-length observations of stationary ergodic random processes according to their nonparametric generative models without prior knowledge of the model statistics and the number of generative models. Two algorithms, both using the L1-distance between estimated power spectral densities (PSDs) as a measure of dissimilarity, are analyzed. The first algorithm, termed nearest neighbor process clustering (NNPC), to the best of our knowledge, is new and relies on partitioning the nearest neighbor graph of the observations via spectral clustering. The second algorithm, simply referred to as k-means (KM), consists of a single k-means iteration with farthest point initialization and was considered before in the literature, albeit with a different measure of dissimilarity and with asymptotic performance results only. We show that both NNPC and KM succeed with high probability under noise and even when the generative process PSDs overlap significantly, all provided that the observation length is sufficiently large. Our results quantify the tradeoff between the overlap of the generative process PSDs, the noise variance, and the observation length. Finally, we present numerical performance results for synthetic and real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2023

Refining a k-nearest neighbor graph for a computationally efficient spectral clustering

Spectral clustering became a popular choice for data clustering for its ...
research
10/27/2021

Nearest neighbor process: weak convergence and non-asymptotic bound

An empirical measure that results from the nearest neighbors to a given ...
research
11/15/2021

Spectral learning of multivariate extremes

We propose a spectral clustering algorithm for analyzing the dependence ...
research
02/17/2015

Nonparametric Nearest Neighbor Descent Clustering based on Delaunay Triangulation

In our physically inspired in-tree (IT) based clustering algorithm and t...
research
01/29/2015

Bayesian Hierarchical Clustering with Exponential Family: Small-Variance Asymptotics and Reducibility

Bayesian hierarchical clustering (BHC) is an agglomerative clustering me...
research
02/08/2022

Systematically improving existing k-means initialization algorithms at nearly no cost, by pairwise-nearest-neighbor smoothing

We present a meta-method for initializing (seeding) the k-means clusteri...
research
07/30/2020

kth Distance Distributions of n-Dimensional Matérn Cluster Process

In this letter, we derive the CDF (cumulative distribution function) of ...

Please sign up or login with your details

Forgot password? Click here to reset