Targeted Mining of Top-k High Utility Itemsets

03/25/2023
by   Shan Huang, et al.
0

Finding high-importance patterns in data is an emerging data mining task known as High-utility itemset mining (HUIM). Given a minimum utility threshold, a HUIM algorithm extracts all the high-utility itemsets (HUIs) whose utility values are not less than the threshold. This can reveal a wealth of useful information, but the precise needs of users are not well taken into account. In particular, users often want to focus on patterns that have some specific items rather than find all patterns. To overcome that difficulty, targeted mining has emerged, focusing on user preferences, but only preliminary work has been conducted. For example, the targeted high-utility itemset querying algorithm (TargetUM) was proposed, which uses a lexicographic tree to query itemsets containing a target pattern. However, selecting the minimum utility threshold is difficult when the user is not familiar with the processed database. As a solution, this paper formulates the task of targeted mining of the top-k high-utility itemsets and proposes an efficient algorithm called TMKU based on the TargetUM algorithm to discover the top-k target high-utility itemsets (top-k THUIs). At the same time, several pruning strategies are used to reduce memory consumption and execution time. Extensive experiments show that the proposed TMKU algorithm has good performance on real and synthetic datasets.

READ FULL TEXT
research
10/30/2021

TargetUM: Targeted High-Utility Itemset Querying

Traditional high-utility itemset mining (HUIM) aims to determine all hig...
research
06/09/2022

Towards Target High-Utility Itemsets

For applied intelligence, utility-driven pattern discovery algorithms ca...
research
11/26/2020

TKUS: Mining Top-K High-Utility Sequential Patterns

High-utility sequential pattern mining (HUSPM) has recently emerged as a...
research
09/04/2018

A comparative study of top-k high utility itemset mining methods

High Utility Itemset (HUI) mining problem is one of the important proble...
research
03/30/2021

TUSQ: Targeted High-Utility Sequence Querying

Significant efforts have been expended in the research and development o...
research
07/06/2023

Finding Favourite Tuples on Data Streams with Provably Few Comparisons

One of the most fundamental tasks in data science is to assist a user wi...
research
12/27/2021

An efficient mining scheme for high utility itemsets

Knowledge discovery in databases aims at finding useful information, whi...

Please sign up or login with your details

Forgot password? Click here to reset