Mining Closed Episodes with Simultaneous Events

04/16/2019
by   Nikolaj Tatti, et al.
0

Sequential pattern discovery is a well-studied field in data mining. Episodes are sequential patterns describing events that often occur in the vicinity of each other. Episodes can impose restrictions to the order of the events, which makes them a versatile technique for describing complex patterns in the sequence. Most of the research on episodes deals with special cases such as serial, parallel, and injective episodes, while discovering general episodes is understudied. In this paper we extend the definition of an episode in order to be able to represent cases where events often occur simultaneously. We present an efficient and novel miner for discovering frequent and closed general episodes. Such a task presents unique challenges. Firstly, we cannot define closure based on frequency. We solve this by computing a more conservative closure that we use to reduce the search space and discover the closed episodes as a postprocessing step. Secondly, episodes are traditionally presented as directed acyclic graphs. We argue that this representation has drawbacks leading to redundancy in the output. We solve these drawbacks by defining a subset relationship in such a way that allows us to remove the redundant episodes. We demonstrate the efficiency of our algorithm and the need for using closed episodes empirically on synthetic and real-world datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2019

Mining Closed Strict Episodes

Discovering patterns in a sequence is an important aspect of data mining...
research
02/04/2019

Ranking Episodes using a Partition Model

One of the biggest setbacks in traditional frequent pattern mining is th...
research
04/15/2019

Discovering Episodes with Compact Minimal Windows

Discovering the most interesting patterns is the key problem in the fiel...
research
05/19/2018

Free-rider Episode Screening via Dual Partition Model

One of the drawbacks of frequent episode mining is that overwhelmingly m...
research
02/05/2019

Discovering bursts revisited: guaranteed optimization of the model parameters

One of the classic data mining tasks is to discover bursts, time interva...
research
04/04/2022

Explicit and Implicit Pattern Relation Analysis for Discovering Actionable Negative Sequences

Real-life events, behaviors and interactions produce sequential data. An...
research
10/06/2020

Discovery data topology with the closure structure. Theoretical and practical aspects

In this paper, we are revisiting pattern mining and especially itemset m...

Please sign up or login with your details

Forgot password? Click here to reset