Graph Distances and Clustering

04/06/2020
by   Pierre Miasnikof, et al.
0

With a view on graph clustering, we present a definition of vertex-to-vertex distance which is based on shared connectivity. We argue that vertices sharing more connections are closer to each other than vertices sharing fewer connections. Our thesis is centered on the widely accepted notion that strong clusters are formed by high levels of induced subgraph density, where subgraphs represent clusters. We argue these clusters are formed by grouping vertices deemed to be similar in their connectivity. At the cluster level (induced subgraph level), our thesis translates into low mean intra-cluster distances. Our definition differs from the usual shortest-path geodesic distance. In this article, we compare three distance measures from the literature. Our benchmark is the accuracy of each measure's reflection of intra-cluster density, when aggregated (averaged) at the cluster level. We conduct our tests on synthetic graphs generated using the planted partition model, where clusters and intra-cluster density are known in advance. We examine correlations between mean intra-cluster distances and intra-cluster densities. Our numerical experiments show that Jaccard and Otsuka-Ochiai offer very accurate measures of density, when averaged over vertex pairs within clusters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2019

A limit theorem for the 1st Betti number of layer-1 subgraphs in random graphs

We initiate the study of local topology of random graphs. The high level...
research
08/31/2017

Distances between bicliques and structural properties of bicliques in graphs

A biclique is a maximal bipartite complete induced subgraph of G. The bi...
research
09/28/2021

Clustering to the Fewest Clusters Under Intra-Cluster Dissimilarity Constraints

This paper introduces the equiwide clustering problem, where valid parti...
research
11/10/2017

Clustering with Local Restrictions

We study a family of graph clustering problems where each cluster has to...
research
11/19/2021

An Asymptotic Equivalence between the Mean-Shift Algorithm and the Cluster Tree

Two important nonparametric approaches to clustering emerged in the 1970...
research
10/08/2018

An ensemble based on a bi-objective evolutionary spectral algorithm for graph clustering

Graph clustering is a challenging pattern recognition problem whose goal...
research
03/04/2022

Graph clustering with Boltzmann machines

Graph clustering is the process of grouping vertices into densely connec...

Please sign up or login with your details

Forgot password? Click here to reset