Hierarchical Graph Clustering using Node Pair Sampling

06/05/2018
by   Thomas Bonald, et al.
0

We present a novel hierarchical graph clustering algorithm inspired by modularity-based clustering techniques. The algorithm is agglomerative and based on a simple distance between clusters induced by the probability of sampling node pairs. We prove that this distance is reducible, which enables the use of the nearest-neighbor chain to speed up the agglomeration. The output of the algorithm is a regular dendrogram, which reveals the multi-scale structure of the graph. The results are illustrated on both synthetic and real datasets.

READ FULL TEXT

page 8

page 9

research
03/15/2022

Natural Hierarchical Cluster Analysis by Nearest Neighbors with Near-Linear Time Complexity

We propose a nearest neighbor based clustering algorithm that results in...
research
01/29/2015

Bayesian Hierarchical Clustering with Exponential Family: Small-Variance Asymptotics and Reducibility

Bayesian hierarchical clustering (BHC) is an agglomerative clustering me...
research
07/13/2018

Learning Graph Representations by Dendrograms

Hierarchical graph clustering is a common technique to reveal the multi-...
research
09/22/2016

Realtime Hierarchical Clustering based on Boundary and Surface Statistics

Visual grouping is a key mechanism in human scene perception. There, it ...
research
08/07/2023

TeraHAC: Hierarchical Agglomerative Clustering of Trillion-Edge Graphs

We introduce TeraHAC, a (1+ϵ)-approximate hierarchical agglomerative clu...
research
12/12/2021

Graph-based hierarchical record clustering for unsupervised entity resolution

Here we study the problem of matched record clustering in unsupervised e...
research
04/20/2011

Fast redshift clustering with the Baire (ultra) metric

The Baire metric induces an ultrametric on a dataset and is of linear co...

Please sign up or login with your details

Forgot password? Click here to reset