A Model-Based Approach to Rounding in Spectral Clustering

10/16/2012
by   Leonard K. M. Poon, et al.
0

In spectral clustering, one defines a similarity matrix for a collection of data points, transforms the matrix to get the Laplacian matrix, finds the eigenvectors of the Laplacian matrix, and obtains a partition of the data using the leading eigenvectors. The last step is sometimes referred to as rounding, where one needs to decide how many leading eigenvectors to use, to determine the number of clusters, and to partition the data points. In this paper, we propose a novel method for rounding. The method differs from previous methods in three ways. First, we relax the assumption that the number of clusters equals the number of eigenvectors used. Second, when deciding the number of leading eigenvectors to use, we not only rely on information contained in the leading eigenvectors themselves, but also use subsequent eigenvectors. Third, our method is model-based and solves all the three subproblems of rounding using a class of graphical models called latent tree models. We evaluate our method on both synthetic and real-world data. The results show that our method works correctly in the ideal case where between-clusters similarity is 0, and degrades gracefully as one moves away from the ideal case.

READ FULL TEXT
research
11/09/2020

Dual regularized Laplacian spectral clustering methods on community detection

Spectral clustering methods are widely used for detecting clusters in ne...
research
12/23/2015

Incremental Method for Spectral Clustering of Increasing Orders

The smallest eigenvalues and the associated eigenvectors (i.e., eigenpai...
research
04/07/2017

Fast Spectral Clustering Using Autoencoders and Landmarks

In this paper, we introduce an algorithm for performing spectral cluster...
research
04/24/2019

Construction of the similarity matrix for the spectral clustering method: numerical experiments

Spectral clustering is a powerful method for finding structure in a data...
research
11/01/2019

Regularized Non-negative Spectral Embedding for Clustering

Spectral Clustering is a popular technique to split data points into gro...
research
06/25/2007

Separating populations with wide data: A spectral analysis

In this paper, we consider the problem of partitioning a small data samp...
research
07/23/2008

Data spectroscopy: Eigenspaces of convolution operators and clustering

This paper focuses on obtaining clustering information about a distribut...

Please sign up or login with your details

Forgot password? Click here to reset