Dissecting graph measure performance for node clustering in LFR parameter space

02/20/2022
by   Vladimir Ivashkin, et al.
0

Graph measures that express closeness or distance between nodes can be employed for graph nodes clustering using metric clustering algorithms. There are numerous measures applicable to this task, and which one performs better is an open question. We study the performance of 25 graph measures on generated graphs with different parameters. While usually measure comparisons are limited to general measure ranking on a particular dataset, we aim to explore the performance of various measures depending on graph features. Using an LFR graph generator, we create a dataset of 11780 graphs covering the whole LFR parameter space. For each graph, we assess the quality of clustering with k-means algorithm for each considered measure. Based on this, we determine the best measure for each area of the parameter space. We find that the parameter space consists of distinct zones where one particular measure is the best. We analyze the geometry of the resulting zones and describe it with simple criteria. Given particular graph parameters, this allows us to recommend a particular measure to use for clustering.

READ FULL TEXT

page 9

page 11

research
11/24/2016

Comparative study of histogram distance measures for re-identification

Color based re-identification methods usually rely on a distance functio...
research
05/10/2020

PageRank and The K-Means Clustering Algorithm

We introduce a graph clustering algorithm that generalizes k-means to gr...
research
03/02/2020

How to choose the most appropriate centrality measure?

We propose a new method to select the most appropriate network centralit...
research
09/07/2010

Optimizing an Organized Modularity Measure for Topographic Graph Clustering: a Deterministic Annealing Approach

This paper proposes an organized generalization of Newman and Girvan's m...
research
03/14/2021

Pandemonium: a clustering tool to partition parameter space – application to the B anomalies

We introduce the interactive tool pandemonium to cluster model predictio...
research
12/18/2018

cellPACKexplorer: Interactive Model Building for Volumetric Data of Complex Cells

Given an algorithm the quality of the output largely depends on a proper...

Please sign up or login with your details

Forgot password? Click here to reset