Dynamic Structural Clustering on Graphs

08/26/2021
by   Boyu Ruan, et al.
0

Structural Clustering (DynClu) is one of the most popular graph clustering paradigms. In this paper, we consider StrClu under two commonly adapted similarities, namely Jaccard similarity and cosine similarity on a dynamic graph, G = ⟨ V, E⟩, subject to edge insertions and deletions (updates). The goal is to maintain certain information under updates, so that the StrClu clustering result on G can be retrieved in O(|V| + |E|) time, upon request. The state-of-the-art worst-case cost is O(|V|) per update; we improve this update-time bound significantly with the ρ-approximate notion. Specifically, for a specified failure probability, δ^*, and every sequence of M updates (no need to know M's value in advance), our algorithm, DynELM, achieves O(log^2 |V| + log |V| ·logM/δ^*) amortized cost for each update, at all times in linear space. Moreover, DynELM provides a provable "sandwich" guarantee on the clustering quality at all times after each update with probability at least 1 - δ^*. We further develop DynELM into our ultimate algorithm, DynStrClu, which also supports cluster-group-by queries. Given Q⊆ V, this puts the non-empty intersection of Q and each StrClu cluster into a distinct group. DynStrClu not only achieves all the guarantees of DynELM, but also runs cluster-group-by queries in O(|Q|·log |V|) time. We demonstrate the performance of our algorithms via extensive experiments, on 15 real datasets. Experimental results confirm that our algorithms are up to three orders of magnitude more efficient than state-of-the-art competitors, and still provide quality structural clustering results. Furthermore, we study the difference between the two similarities w.r.t. the quality of approximate clustering results.

READ FULL TEXT

page 25

page 26

page 27

research
03/11/2022

Fully-dynamic α+ 2 Arboricity Decomposition and Implicit Colouring

In the implicit dynamic colouring problem, the task is to maintain a rep...
research
07/19/2023

On Dynamic Graph Algorithms with Predictions

We study dynamic algorithms in the model of algorithms with predictions....
research
03/15/2018

Dynamic Approximate Matchings with an Optimal Recourse Bound

In the area of dynamic matching algorithms, the number of changes to the...
research
05/10/2022

Parallel Batch-Dynamic Minimum Spanning Forest and the Efficiency of Dynamic Agglomerative Graph Clustering

Hierarchical agglomerative clustering (HAC) is a popular algorithm for c...
research
06/23/2020

Learning Based Distributed Tracking

Inspired by the great success of machine learning in the past decade, pe...
research
11/13/2022

Improved Dynamic Colouring of Sparse Graphs

Given a dynamic graph subject to edge insertions and deletions, we show ...
research
09/28/2016

StruClus: Structural Clustering of Large-Scale Graph Databases

We present a structural clustering algorithm for large-scale datasets of...

Please sign up or login with your details

Forgot password? Click here to reset