Mining Non-Redundant Sets of Generalizing Patterns from Sequence Databases

12/12/2017
by   Niek Tax, et al.
0

Sequential pattern mining techniques extract patterns corresponding to frequent subsequences from a sequence database. A practical limitation of these techniques is that they overload the user with too many patterns. Local Process Model (LPM) mining is an alternative approach coming from the field of process mining. While in traditional sequential pattern mining, a pattern describes one subsequence, an LPM captures a set of subsequences. Also, while traditional sequential patterns only match subsequences that are observed in the sequence database, an LPM may capture subsequences that are not explicitly observed, but that are related to observed subsequences. In other words, LPMs generalize the behavior observed in the sequence database. These properties make it possible for a set of LPMs to cover the behavior of a much larger set of sequential patterns. Yet, existing LPM mining techniques still suffer from the pattern explosion problem because they produce sets of redundant LPMs. In this paper, we propose several heuristics to mine a set of non-redundant LPMs either from a set of redundant LPMs or from a set of sequential patterns. We empirically compare the proposed heuristics between them and against existing (local) process mining techniques in terms of coverage, precision, and complexity of the produced sets of LPMs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/04/2009

Mining Compressed Repetitive Gapped Sequential Patterns Efficiently

Mining frequent sequential patterns from sequence databases has been a c...
research
07/26/2017

Declarative Sequential Pattern Mining of Care Pathways

Sequential pattern mining algorithms are widely used to explore care pat...
research
02/16/2016

A Subsequence Interleaving Model for Sequential Pattern Mining

Recent sequential pattern mining methods have used the minimum descripti...
research
06/28/2019

RECURSIA-RRT: Recursive translatable point-set pattern discovery with removal of redundant translators

Two algorithms, RECURSIA and RRT, are presented, designed to increase th...
research
11/14/2018

Constraint-based Sequential Pattern Mining with Decision Diagrams

Constrained sequential pattern mining aims at identifying frequent patte...
research
06/21/2020

Database Optimization to Recommend Software Developers using Canonical Order Tree

Recently frequent and sequential pattern mining algorithms have been wid...
research
02/07/2019

The Long and the Short of It: Summarising Event Sequences with Serial Episodes

An ideal outcome of pattern mining is a small set of informative pattern...

Please sign up or login with your details

Forgot password? Click here to reset