On Robustness of Kernel Clustering

06/06/2016
by   Bowei Yan, et al.
0

Clustering is one of the most important unsupervised problems in machine learning and statistics. Among many existing algorithms, kernel k-means has drawn much research attention due to its ability to find non-linear cluster boundaries and its inherent simplicity. There are two main approaches for kernel k-means: SVD of the kernel matrix and convex relaxations. Despite the attention kernel clustering has received both from theoretical and applied quarters, not much is known about robustness of the methods. In this paper we first introduce a semidefinite programming relaxation for the kernel clustering problem, then prove that under a suitable model specification, both the K-SVD and SDP approaches are consistent in the limit, albeit SDP is strongly consistent, i.e. achieves exact recovery, whereas K-SVD is weakly consistent, i.e. the fraction of misclassified nodes vanish.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2019

Diffusion K-means clustering on manifolds: provable exact recovery via semidefinite relaxations

We introduce the diffusion K-means clustering method on Riemannian subm...
research
12/01/2019

On the optimality of kernels for high-dimensional clustering

This paper studies the optimality of kernel methods in high-dimensional ...
research
02/07/2020

Fast Kernel k-means Clustering Using Incomplete Cholesky Factorization

Kernel-based clustering algorithm can identify and capture the non-linea...
research
08/26/2016

A Randomized Approach to Efficient Kernel Clustering

Kernel-based K-means clustering has gained popularity due to its simplic...
research
08/28/2019

Similarity Kernel and Clustering via Random Projection Forests

Similarity plays a fundamental role in many areas, including data mining...
research
12/23/2020

K-Means Kernel Classifier

We combine K-means clustering with the least-squares kernel classificati...
research
10/12/2020

The Impact of Isolation Kernel on Agglomerative Hierarchical Clustering Algorithms

Agglomerative hierarchical clustering (AHC) is one of the popular cluste...

Please sign up or login with your details

Forgot password? Click here to reset