Fast Utility Mining on Complex Sequences

04/28/2019
by   Wensheng Gan, et al.
0

High-utility sequential pattern mining is an emerging topic in the field of Knowledge Discovery in Databases. It consists of discovering subsequences having a high utility (importance) in sequences, referred to as high-utility sequential patterns (HUSPs). HUSPs can be applied to many real-life applications, such as market basket analysis, E-commerce recommendation, click-stream analysis and scenic route planning. For example, in economics and targeted marketing, understanding economic behavior of consumers is quite challenging, such as finding credible and reliable information on product profitability. Several algorithms have been proposed to address this problem by efficiently mining utility-based useful sequential patterns. Nevertheless, the performance of these algorithms can be unsatisfying in terms of runtime and memory usage due to the combinatorial explosion of the search space for low utility threshold and large databases. Hence, this paper proposes a more efficient algorithm for the task of high-utility sequential pattern mining, called HUSP-ULL. It utilizes a lexicographic sequence (LS)-tree and a utility-linked (UL)-list structure to fast discover HUSPs. Furthermore, two pruning strategies are introduced in HUSP-ULL to obtain tight upper-bounds on the utility of candidate sequences, and reduce the search space by pruning unpromising candidates early. Substantial experiments both on real-life and synthetic datasets show that the proposed algorithm can effectively and efficiently discover the complete set of HUSPs and outperforms the state-of-the-art algorithms.

READ FULL TEXT

page 12

page 13

research
12/29/2022

HUSP-SP: Faster Utility Mining on Sequence Data

High-utility sequential pattern mining (HUSPM) has emerged as an importa...
research
02/22/2023

Mining compact high utility sequential patterns

High utility sequential pattern mining (HUSPM) aims to mine all patterns...
research
12/25/2019

Utility Mining Across Multi-Sequences with Individualized Thresholds

Utility-oriented pattern mining has become an emerging topic since it ca...
research
02/25/2019

Utility Mining Across Multi-Dimensional Sequences

Knowledge extraction from database is the fundamental task in database a...
research
10/30/2021

Utility-driven Mining of Contiguous Sequences

Recently, contiguous sequential pattern mining (CSPM) gained interest as...
research
02/25/2019

Beyond Frequency: Utility Mining with Varied Item-Specific Minimum Utility

Utility-oriented mining which integrates utility theory and data mining ...
research
03/11/2021

FRI-Miner: Fuzzy Rare Itemset Mining

Data mining is a widely used technology for various real-life applicatio...

Please sign up or login with your details

Forgot password? Click here to reset