Mining Best Closed Itemsets for Projection-antimonotonic Constraints in Polynomial Time

03/28/2017
by   Aleksey Buzmakov, et al.
0

The exponential explosion of the set of patterns is one of the main challenges in pattern mining. This challenge is approached by introducing a constraint for pattern selection. One of the first constraints proposed in pattern mining is support (frequency) of a pattern in a dataset. Frequency is an anti-monotonic function, i.e., given an infrequent pattern, all its superpatterns are not frequent. However, many other constraints for pattern selection are neither monotonic nor anti-monotonic, which makes it difficult to generate patterns satisfying these constraints. In order to deal with nonmonotonic constraints we introduce the notion of "projection antimonotonicity" and SOFIA algorithm that allow generating best patterns for a class of nonmonotonic constraints. Cosine interest, robustness, stability of closed itemsets, and the associated delta-measure are among these constraints. SOFIA starts from light descriptions of transactions in dataset (a small set of items in the case of itemset description) and then iteratively adds more information to these descriptions (more items with indication of tidsets they describe).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2015

Fast Generation of Best Interval Patterns for Nonmonotonic Constraints

In pattern mining, the main challenge is the exponential explosion of th...
research
02/18/2019

Finding Robust Itemsets Under Subsampling

Mining frequent patterns is plagued by the problem of pattern explosion ...
research
11/14/2018

Constraint-based Sequential Pattern Mining with Decision Diagrams

Constrained sequential pattern mining aims at identifying frequent patte...
research
01/20/2022

FreSCo: Mining Frequent Patterns in Simplicial Complexes

Simplicial complexes are a generalization of graphs that model higher-or...
research
06/10/2023

TALENT: Targeted Mining of Non-overlapping Sequential Patterns

With the widespread application of efficient pattern mining algorithms, ...
research
10/28/2016

Flexible constrained sampling with guarantees for pattern mining

Pattern sampling has been proposed as a potential solution to the infamo...
research
10/27/2015

Redesigning pattern mining algorithms for supercomputers

Upcoming many core processors are expected to employ a distributed memor...

Please sign up or login with your details

Forgot password? Click here to reset