How the result of graph clustering methods depends on the construction of the graph

02/10/2011
by   Markus Maier, et al.
0

We study the scenario of graph-based clustering algorithms such as spectral clustering. Given a set of data points, one first has to construct a graph on the data points and then apply a graph clustering algorithm to find a suitable partition of the graph. Our main question is if and how the construction of the graph (choice of the graph, choice of parameters, choice of weights) influences the outcome of the final clustering result. To this end we study the convergence of cluster quality measures such as the normalized cut or the Cheeger cut on various kinds of random geometric graphs as the sample size tends to infinity. It turns out that the limit values of the same objective function are systematically different on different types of graphs. This implies that clustering results systematically depend on the graph and can be very different for different types of graph. We provide examples to illustrate the implications on spectral clustering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2020

A Performance Guarantee for Spectral Clustering

The two-step spectral clustering method, which consists of the Laplacian...
research
04/24/2019

Construction of the similarity matrix for the spectral clustering method: numerical experiments

Spectral clustering is a powerful method for finding structure in a data...
research
06/27/2014

A Fuzzy Clustering Algorithm for the Mode Seeking Framework

In this paper, we propose a new fuzzy clustering algorithm based on the ...
research
08/15/2013

Axioms for graph clustering quality functions

We investigate properties that intuitively ought to be satisfied by grap...
research
06/04/2019

A numerical measure of the instability of Mapper-type algorithms

Mapper is an unsupervised machine learning algorithm generalising the no...
research
03/05/2015

Spectral Clustering by Ellipsoid and Its Connection to Separable Nonnegative Matrix Factorization

This paper proposes a variant of the normalized cut algorithm for spectr...
research
09/13/2022

Large data limit of the MBO scheme for data clustering: convergence of the dynamics

We prove that the dynamics of the MBO scheme for data clustering converg...

Please sign up or login with your details

Forgot password? Click here to reset