TKUS: Mining Top-K High-Utility Sequential Patterns

11/26/2020
by   Chunkai Zhang, et al.
0

High-utility sequential pattern mining (HUSPM) has recently emerged as a focus of intense research interest. The main task of HUSPM is to find all subsequences, within a quantitative sequential database, that have high utility with respect to a user-defined minimum utility threshold. However, it is difficult to specify the minimum utility threshold, especially when database features, which are invisible in most cases, are not understood. To handle this problem, top-k HUSPM was proposed. Up to now, only very preliminary work has been conducted to capture top-k HUSPs, and existing strategies require improvement in terms of running time, memory consumption, unpromising candidate filtering, and scalability. Moreover, no systematic problem statement has been defined. In this paper, we formulate the problem of top-k HUSPM and propose a novel algorithm called TKUS. To improve efficiency, TKUS adopts a projection and local search mechanism and employs several schemes, including the Sequence Utility Raising, Terminate Descendants Early, and Eliminate Unpromising Items strategies, which allow it to greatly reduce the search space. Finally, experimental results demonstrate that TKUS can achieve sufficiently good top-k HUSPM performance compared to state-of-the-art algorithm TKHUS-Span.

READ FULL TEXT

page 2

page 16

page 17

page 19

research
12/29/2022

HUSP-SP: Faster Utility Mining on Sequence Data

High-utility sequential pattern mining (HUSPM) has emerged as an importa...
research
03/25/2023

Targeted Mining of Top-k High Utility Itemsets

Finding high-importance patterns in data is an emerging data mining task...
research
08/27/2022

A Generic Algorithm for Top-K On-Shelf Utility Mining

On-shelf utility mining (OSUM) is an emerging research direction in data...
research
06/09/2022

Towards Target High-Utility Itemsets

For applied intelligence, utility-driven pattern discovery algorithms ca...
research
11/26/2020

On-shelf Utility Mining of Sequence Data

Utility mining has emerged as an important and interesting topic owing t...
research
05/24/2019

Induction of Non-Monotonic Rules From Statistical Learning Models Using High-Utility Itemset Mining

We present a fast and scalable algorithm to induce non-monotonic logic p...
research
06/28/2021

THUE: Discovering Top-K High Utility Episodes

Episode discovery from an event is a popular framework for data mining t...

Please sign up or login with your details

Forgot password? Click here to reset