TOPIC: Top-k High-Utility Itemset Discovering

06/28/2021
by   Jiahui Chen, et al.
0

Utility-driven itemset mining is widely applied in many real-world scenarios. However, most algorithms do not work for itemsets with negative utilities. Several efficient algorithms for high-utility itemset (HUI) mining with negative utilities have been proposed. These algorithms can find complete HUIs with or without negative utilities. However, the major problem with these algorithms is how to select an appropriate minimum utility (minUtil) threshold. To address this issue, some efficient algorithms for extracting top-k HUIs have been proposed, where parameter k is the quantity of HUIs to be discovered. However, all of these algorithms can solve only one part of the above problem. In this paper, we present a method for TOP-k high-utility Itemset disCovering (TOPIC) with positive and negative utility values, which utilizes the advantages of the above algorithms. TOPIC adopts transaction merging and database projection techniques to reduce the database scanning cost, and utilizes minUtil threshold raising strategies. It also uses an array-based utility technique, which calculates the utility of itemsets and upper bounds in linear time. We conducted extensive experiments on several real and synthetic datasets, and the results showed that TOPIC outperforms state-of-the-art algorithm in terms of runtime, memory costs, and scalability.

READ FULL TEXT
research
08/26/2022

Itemset Utility Maximization with Correlation Measure

As an important data mining technology, high utility itemset mining (HUI...
research
12/29/2022

HUSP-SP: Faster Utility Mining on Sequence Data

High-utility sequential pattern mining (HUSPM) has emerged as an importa...
research
12/18/2018

High-utility itemset mining for subadditive monotone utility functions

High-utility Itemset Mining (HUIM) finds itemsets from a transaction dat...
research
10/30/2021

FUIM: Fuzzy Utility Itemset Mining

Because of usefulness and comprehensibility, fuzzy data mining has been ...
research
11/17/2019

A one-phase tree-based algorithm for mining high-utility itemsets from a transaction database

High-utility itemset mining finds itemsets from a transaction database w...
research
06/28/2021

THUE: Discovering Top-K High Utility Episodes

Episode discovery from an event is a popular framework for data mining t...
research
02/25/2019

Utility-driven Data Analytics on Uncertain Data

Modern Internet of Things (IoT) applications generate massive amounts of...

Please sign up or login with your details

Forgot password? Click here to reset