Data Clustering and Graph Partitioning via Simulated Mixing

03/15/2016
by   Shahzad Bhatti, et al.
0

Spectral clustering approaches have led to well-accepted algorithms for finding accurate clusters in a given dataset. However, their application to large-scale datasets has been hindered by computational complexity of eigenvalue decompositions. Several algorithms have been proposed in the recent past to accelerate spectral clustering, however they compromise on the accuracy of the spectral clustering to achieve faster speed. In this paper, we propose a novel spectral clustering algorithm based on a mixing process on a graph. Unlike the existing spectral clustering algorithms, our algorithm does not require computing eigenvectors. Specifically, it finds the equivalent of a linear combination of eigenvectors of the normalized similarity matrix weighted with corresponding eigenvalues. This linear combination is then used to partition the dataset into meaningful clusters. Simulations on real datasets show that partitioning datasets based on such linear combinations of eigenvectors achieves better accuracy than standard spectral clustering methods as the number of clusters increase. Our algorithm can easily be implemented in a distributed setting.

READ FULL TEXT

page 22

page 28

research
08/02/2022

A Tighter Analysis of Spectral Clustering, and Beyond

This work studies the classical spectral clustering algorithm which embe...
research
05/11/2023

Spectral Clustering on Large Datasets: When Does it Work? Theory from Continuous Clustering and Density Cheeger-Buser

Spectral clustering is one of the most popular clustering algorithms tha...
research
02/04/2023

FedSpectral+: Spectral Clustering using Federated Learning

Clustering in graphs has been a well-known research problem, particularl...
research
10/05/2013

Role of normalization in spectral clustering for stochastic blockmodels

Spectral clustering is a technique that clusters elements using the top ...
research
01/07/2013

Efficient Eigen-updating for Spectral Graph Clustering

Partitioning a graph into groups of vertices such that those within each...
research
03/16/2021

K-expectiles clustering

K-means clustering is one of the most widely-used partitioning algorithm...
research
07/21/2020

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

Spectral clustering has shown a superior performance in analyzing the cl...

Please sign up or login with your details

Forgot password? Click here to reset