Kernel Spectral Clustering and applications

05/03/2015
by   Rocco Langone, et al.
0

In this chapter we review the main literature related to kernel spectral clustering (KSC), an approach to clustering cast within a kernel-based optimization setting. KSC represents a least-squares support vector machine based formulation of spectral clustering described by a weighted kernel PCA objective. Just as in the classifier case, the binary clustering model is expressed by a hyperplane in a high dimensional space induced by a kernel. In addition, the multi-way clustering can be obtained by combining a set of binary decision functions via an Error Correcting Output Codes (ECOC) encoding scheme. Because of its model-based nature, the KSC method encompasses three main steps: training, validation, testing. In the validation stage model selection is performed to obtain tuning parameters, like the number of clusters present in the data. This is a major advantage compared to classical spectral clustering where the determination of the clustering parameters is unclear and relies on heuristics. Once a KSC model is trained on a small subset of the entire data, it is able to generalize well to unseen test points. Beyond the basic formulation, sparse KSC algorithms based on the Incomplete Cholesky Decomposition (ICD) and L_0, L_1, L_0 + L_1, Group Lasso regularization are reviewed. In that respect, we show how it is possible to handle large scale data. Also, two possible ways to perform hierarchical clustering and a soft clustering method are presented. Finally, real-world applications such as image segmentation, power load time-series clustering, document clustering and big data learning are considered.

READ FULL TEXT

page 13

page 19

page 20

page 26

research
11/20/2014

Clustering evolving data using kernel-based methods

In this thesis, we propose several modelling strategies to tackle evolvi...
research
12/12/2018

Image Segmentation Based on Multiscale Fast Spectral Clustering

In recent years, spectral clustering has become one of the most popular ...
research
07/18/2017

Guided Co-training for Large-Scale Multi-View Spectral Clustering

In many real-world applications, we have access to multiple views of the...
research
02/23/2017

Spectral Clustering using PCKID - A Probabilistic Cluster Kernel for Incomplete Data

In this paper, we propose PCKID, a novel, robust, kernel function for sp...
research
02/06/2019

An Automated Spectral Clustering for Multi-scale Data

Spectral clustering algorithms typically require a priori selection of i...
research
06/24/2015

Kernel Cuts: MRF meets Kernel & Spectral Clustering

We propose a new segmentation model combining common regularization ener...
research
02/24/2021

Two-way kernel matrix puncturing: towards resource-efficient PCA and spectral clustering

The article introduces an elementary cost and storage reduction method f...

Please sign up or login with your details

Forgot password? Click here to reset