Explainable Graph Spectral Clustering of Text Documents

08/01/2023
by   Bartłomiej Starosta, et al.
0

Spectral clustering methods are known for their ability to represent clusters of diverse shapes, densities etc. However, results of such algorithms, when applied e.g. to text documents, are hard to explain to the user, especially due to embedding in the spectral space which has no obvious relation to document contents. Therefore there is an urgent need to elaborate methods for explaining the outcome of the clustering. This paper presents a contribution towards this goal. We present a proposal of explanation of results of combinatorial Laplacian based graph spectral clustering. It is based on showing (approximate) equivalence of combinatorial Laplacian embedding, K-embedding (proposed in this paper) and term vector space embedding. Hence a bridge is constructed between the textual contents and the clustering results. We provide theoretical background for this approach. We performed experimental study showing that K-embedding approximates well Laplacian embedding under favourable block matrix conditions and show that approximation is good enough under other conditions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2017

Spectral Clustering via Graph Filtering: Consistency on the High-Dimensional Stochastic Block Model

Spectral clustering is amongst the most popular methods for community de...
research
07/23/2021

The decomposition of the higher-order homology embedding constructed from the k-Laplacian

The null space of the k-th order Laplacian ℒ_k, known as the k-th homolo...
research
06/14/2023

Multi-class Graph Clustering via Approximated Effective p-Resistance

This paper develops an approximation to the (effective) p-resistance and...
research
06/10/2014

Graph Approximation and Clustering on a Budget

We consider the problem of learning from a similarity matrix (such as sp...
research
04/05/2019

Simultaneous Dimensionality and Complexity Model Selection for Spectral Graph Clustering

Our problem of interest is to cluster vertices of a graph by identifying...
research
10/25/2018

Spectral Embedding Norm: Looking Deep into the Spectrum of the Graph Laplacian

The extraction of clusters from a dataset which includes multiple cluste...
research
02/22/2023

Approximate spectral clustering with eigenvector selection and self-tuned k

The recently emerged spectral clustering surpasses conventional clusteri...

Please sign up or login with your details

Forgot password? Click here to reset