SSumM: Sparse Summarization of Massive Graphs

06/01/2020
by   Kyuhan Lee, et al.
2

Given a graph G and the desired size k in bits, how can we summarize G within k bits, while minimizing the information loss? Large-scale graphs have become omnipresent, posing considerable computational challenges. Analyzing such large graphs can be fast and easy if they are compressed sufficiently to fit in main memory or even cache. Graph summarization, which yields a coarse-grained summary graph with merged nodes, stands out with several advantages among graph compression techniques. Thus, a number of algorithms have been developed for obtaining a concise summary graph with little information loss or equivalently small reconstruction error. However, the existing methods focus solely on reducing the number of nodes, and they often yield dense summary graphs, failing to achieve better compression rates. Moreover, due to their limited scalability, they can be applied only to moderate-size graphs. In this work, we propose SSumM, a scalable and effective graph-summarization algorithm that yields a sparse summary graph. SSumM not only merges nodes together but also sparsifies the summary graph, and the two strategies are carefully balanced based on the minimum description length principle. Compared with state-of-the-art competitors, SSumM is (a) Concise: yields up to 11.2X smaller summary graphs with similar reconstruction error, (b) Accurate: achieves up to 4.2X smaller reconstruction error with similarly concise outputs, and (c) Scalable: summarizes 26X larger graphs while exhibiting linear scalability. We validate these advantages through extensive experiments on 10 real-world graphs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2020

Incremental Lossless Graph Summarization

Given a fully dynamic graph, represented as a stream of edge insertions ...
research
03/28/2022

Personalized Graph Summarization: Formulation, Scalable Algorithms, and Applications

Are users of an online social network interested equally in all connecti...
research
06/15/2022

Summarizing Labeled Multi-Graphs

Real-world graphs can be difficult to interpret and visualize beyond a c...
research
12/10/2021

SLUGGER: Lossless Hierarchical Summarization of Massive Graphs

Given a massive graph, how can we exploit its hierarchical structure for...
research
03/29/2022

Are Edge Weights in Summary Graphs Useful? – A Comparative Study

Which one is better between two representative graph summarization model...
research
06/16/2020

Utility-Based Graph Summarization: New and Improved

A fundamental challenge in graph mining is the ever-increasing size of d...
research
07/04/2022

Learning node embeddings via summary graphs: a brief theoretical analysis

Graph representation learning plays an important role in many graph mini...

Please sign up or login with your details

Forgot password? Click here to reset