FreSCo: Mining Frequent Patterns in Simplicial Complexes

01/20/2022
by   Giulia Preti, et al.
0

Simplicial complexes are a generalization of graphs that model higher-order relations. In this paper, we introduce simplicial patterns – that we call simplets – and generalize the task of frequent pattern mining from the realm of graphs to that of simplicial complexes. Our task is particularly challenging due to the enormous search space and the need for higher-order isomorphism. We show that finding the occurrences of simplets in a complex can be reduced to a bipartite graph isomorphism problem, in linear time and at most quadratic space. We then propose an anti-monotonic frequency measure that allows us to start the exploration from small simplets and stop expanding a simplet as soon as its frequency falls below the minimum frequency threshold. Equipped with these ideas and a clever data structure, we develop a memory-conscious algorithm that, by carefully exploiting the relationships among the simplices in the complex and among the simplets, achieves efficiency and scalability for our complex mining task. Our algorithm, FreSCo, comes in two flavors: it can compute the exact frequency of the simplets or, more quickly, it can determine whether a simplet is frequent, without having to compute the exact frequency. Experimental results prove the ability of FreSCo to mine frequent simplets in complexes of various size and dimension, and the significance of the simplets with respect to the traditional graph patterns.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2015

Fast Generation of Best Interval Patterns for Nonmonotonic Constraints

In pattern mining, the main challenge is the exponential explosion of th...
research
02/16/2022

Near-optimal Top-k Pattern Mining

Nowadays, frequent pattern mining (FPM) on large graphs receives increas...
research
04/16/2019

Most Frequent Itemset Optimization

In this paper we are dealing with the frequent itemset mining. We concen...
research
02/07/2019

Significance of Episodes Based on Minimal Windows

Discovering episodes, frequent sets of events from a sequence has been a...
research
03/28/2017

Mining Best Closed Itemsets for Projection-antimonotonic Constraints in Polynomial Time

The exponential explosion of the set of patterns is one of the main chal...
research
06/16/2020

MCRapper: Monte-Carlo Rademacher Averages for Poset Families and Approximate Pattern Mining

We present MCRapper, an algorithm for efficient computation of Monte-Car...
research
01/07/2013

Finding the True Frequent Itemsets

Frequent Itemsets (FIs) mining is a fundamental primitive in data mining...

Please sign up or login with your details

Forgot password? Click here to reset