Mining Frequent Patterns in Evolving Graphs

09/02/2018
by   Cigdem Aslay, et al.
0

Given a labeled graph, the frequent-subgraph mining (FSM) problem asks to find all the k-vertex subgraphs that appear with frequency greater than a given threshold. FSM has numerous applications ranging from biology to network science, as it provides a compact summary of the characteristics of the graph. However, the task is challenging, even more so for evolving graphs due to the streaming nature of the input and the exponential time complexity of the problem. In this paper, we initiate the study of the approximate FSM problem in both incremental and fully-dynamic streaming settings, where arbitrary edges can be added or removed from the graph. For each streaming setting, we propose algorithms that can extract a high-quality approximation of the frequent k-vertex subgraphs for a given threshold, at any given time instance, with high probability. In contrast to the existing state-of-the-art solutions that require iterating over the entire set of subgraphs for any update, our algorithms operate by maintaining a uniform sample of k-vertex subgraphs with optimized neighborhood-exploration procedures local to the updates. We provide theoretical analysis of the proposed algorithms and empirically demonstrate that the proposed algorithms generate high-quality results compared to baselines.

READ FULL TEXT
research
11/01/2019

A limit theorem for the 1st Betti number of layer-1 subgraphs in random graphs

We initiate the study of local topology of random graphs. The high level...
research
03/10/2018

Efficient Enumeration of Bipartite Subgraphs in Graphs

Subgraph enumeration problems ask to output all subgraphs of an input gr...
research
11/24/2021

Scale-Invariant Strength Assortativity of Streaming Butterflies

Bipartite graphs are rich data structures with prevalent applications an...
research
07/15/2012

Classification of Approaches and Challenges of Frequent Subgraphs Mining in Biological Networks

Understanding the structure and dynamics of biological networks is one o...
research
08/18/2020

Mining Large Quasi-cliques with Quality Guarantees from Vertex Neighborhoods

Mining dense subgraphs is an important primitive across a spectrum of gr...
research
11/21/2022

Expander Decomposition in Dynamic Streams

In this paper we initiate the study of expander decompositions of a grap...
research
09/02/2021

Computing Graph Descriptors on Edge Streams

Graph feature extraction is a fundamental task in graphs analytics. Usin...

Please sign up or login with your details

Forgot password? Click here to reset