Graph Summarization via Node Grouping: A Spectral Algorithm

11/08/2022
by   Arpit Merchant, et al.
0

Graph summarization via node grouping is a popular method to build concise graph representations by grouping nodes from the original graph into supernodes and encoding edges into superedges such that the loss of adjacency information is minimized. Such summaries have immense applications in large-scale graph analytics due to their small size and high query processing efficiency. In this paper, we reformulate the loss minimization problem for summarization into an equivalent integer maximization problem. By initially allowing relaxed (fractional) solutions for integer maximization, we analytically expose the underlying connections to the spectral properties of the adjacency matrix. Consequently, we design an algorithm called SpecSumm that consists of two phases. In the first phase, motivated by spectral graph theory, we apply k-means clustering on the k largest (in magnitude) eigenvectors of the adjacency matrix to assign nodes to supernodes. In the second phase, we propose a greedy heuristic that updates the initial assignment to further improve summary quality. Finally, via extensive experiments on 11 datasets, we show that SpecSumm efficiently produces high-quality summaries compared to state-of-the-art summarization algorithms and scales to graphs with millions of nodes.

READ FULL TEXT
research
12/24/2021

Multi-relation Graph Summarization

Graph summarization is beneficial in a wide range of applications, such ...
research
05/27/2022

Efficient Semantic Summary Graphs for Querying Large Knowledge Graphs

Knowledge Graphs (KGs) integrate heterogeneous data, but one challenge i...
research
05/28/2021

SLGCN: Structure Learning Graph Convolutional Networks for Graphs under Heterophily

The performances of GNNs for representation learning on the graph-struct...
research
04/04/2013

Spectral Descriptors for Graph Matching

In this paper, we consider the weighted graph matching problem. Recently...
research
06/16/2020

Utility-Based Graph Summarization: New and Improved

A fundamental challenge in graph mining is the ever-increasing size of d...
research
06/08/2019

Sentence Centrality Revisited for Unsupervised Summarization

Single document summarization has enjoyed renewed interests in recent ye...
research
02/13/2018

Network Summarization with Preserved Spectral Properties

Large-scale networks are widely used to represent object relationships i...

Please sign up or login with your details

Forgot password? Click here to reset