Graph-based data clustering via multiscale community detection

09/06/2019
by   Zijing Liu, et al.
0

We present a graph-theoretical approach to data clustering, which combines the creation of a graph from the data with Markov Stability, a multiscale community detection framework. We show how the multiscale capabilities of the method allow the estimation of the number of clusters, as well as alleviating the sensitivity to the parameters in graph construction. We use both synthetic and benchmark real datasets to compare and evaluate several graph construction methods and clustering algorithms, and show that multiscale graph-based clustering achieves improved performance compared to popular clustering methods without the need to set externally the number of clusters.

READ FULL TEXT

page 5

page 7

page 10

research
03/08/2023

PyGenStability: Multiscale community detection with generalized Markov Stability

We present PyGenStability, a general-use Python software package that pr...
research
12/12/2021

Graph-based hierarchical record clustering for unsupervised entity resolution

Here we study the problem of matched record clustering in unsupervised e...
research
03/21/2023

Community detection in complex networks via node similarity, graph representation learning, and hierarchical clustering

Community detection is a critical challenge in the analysis of real-worl...
research
04/05/2018

Discovering Communities of Malapps on Android-based Mobile Cyber-physical Systems

Android-based devices like smartphones have become ideal mobile cyber-ph...
research
05/04/2019

Clustering-aware Graph Construction: A Joint Learning Perspective

As a promising clustering method, graph-based clustering converts the in...
research
12/07/2021

A graph representation based on fluid diffusion model for multimodal data analysis: theoretical aspects and enhanced community detection

Representing data by means of graph structures identifies one of the mos...
research
01/14/2014

A Boosting Approach to Learning Graph Representations

Learning the right graph representation from noisy, multisource data has...

Please sign up or login with your details

Forgot password? Click here to reset