Simple Distributed Graph Clustering using Modularity and Map Equation

10/26/2017
by   Michael Hamann, et al.
0

We study large-scale, distributed graph clustering. Given an undirected, weighted graph, our objective is to partition the nodes into disjoint sets called clusters. Each cluster should contain many internal edges. Further, there should only be few edges between clusters. We study two established formalizations of this internally-dense-externally-sparse principle: modularity and map equation. We present two versions of a simple distributed algorithm to optimize both measures. They are based on Thrill, a distributed big data processing framework that implements an extended MapReduce model. The algorithms for the two measures, DSLM-Mod and DSLM-Map, differ only slightly. Adapting them for similar quality measures is easy. In an extensive experimental study, we demonstrate the excellent performance of our algorithms on real-world and synthetic graph clustering benchmark graphs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/29/2022

A Distributed Multilevel Memetic Algorithm for Signed Graph Clustering

In real-world applications, interactions between two entities can be usu...
research
11/03/2017

Distributed Graph Clustering and Sparsification

Graph clustering is a fundamental computational problem with a number of...
research
09/28/2016

StruClus: Structural Clustering of Large-Scale Graph Databases

We present a structural clustering algorithm for large-scale datasets of...
research
08/12/2023

Latent Random Steps as Relaxations of Max-Cut, Min-Cut, and More

Algorithms for node clustering typically focus on finding homophilous st...
research
12/16/2020

Interpretable Clustering on Dynamic Graphs with Recurrent Graph Neural Networks

We study the problem of clustering nodes in a dynamic graph, where the c...
research
10/21/2022

A Simple Deterministic Distributed Low-Diameter Clustering

We give a simple, local process for nodes in an undirected graph to form...
research
10/15/2015

Sparsity-aware Possibilistic Clustering Algorithms

In this paper two novel possibilistic clustering algorithms are presente...

Please sign up or login with your details

Forgot password? Click here to reset