Near-optimal Top-k Pattern Mining

02/16/2022
by   Xin Wang, et al.
0

Nowadays, frequent pattern mining (FPM) on large graphs receives increasing attention, since it is crucial to a variety of applications, e.g., social analysis. Informally, the FPM problem is defined as finding all the patterns in a large graph with frequency above a user-defined threshold. However, this problem is nontrivial due to the unaffordable computational and space costs in the mining process. In light of this, we propose a cost-effective approach to mining near-optimal top-k patterns. Our approach applies a "level-wise" strategy to incrementally detect frequent patterns, hence is able to terminate as soon as top-k patterns are discovered. Moreover, we develop a technique to compute the lower bound of support with smart traverse strategy and compact data structures. Extensive experimental studies on real-life and synthetic graphs show that our approach performs well, i.e., it outperforms traditional counterparts in efficiency, memory footprint, recall and scalability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2022

FreSCo: Mining Frequent Patterns in Simplicial Complexes

Simplicial complexes are a generalization of graphs that model higher-or...
research
02/22/2023

Mining compact high utility sequential patterns

High utility sequential pattern mining (HUSPM) aims to mine all patterns...
research
02/15/2015

Fast and Memory-Efficient Significant Pattern Mining via Permutation Testing

We present a novel algorithm, Westfall-Young light, for detecting patter...
research
04/21/2009

Fast Algorithms for Mining Interesting Frequent Itemsets without Minimum Support

Real world datasets are sparse, dirty and contain hundreds of items. In ...
research
03/21/2021

BigCarl: Mining frequent subnets from a single large Petri net

While there have been lots of work studying frequent subgraph mining, ve...
research
05/22/2020

Discovering Frequent Gradual Itemsets with Imprecise Data

The gradual patterns that model the complex co-variations of attributes ...
research
12/19/2019

FIBS: A Generic Framework for Classifying Interval-based Temporal Sequences

We study the problem of classification of interval-based temporal sequen...

Please sign up or login with your details

Forgot password? Click here to reset