A one-phase tree-based algorithm for mining high-utility itemsets from a transaction database

11/17/2019
by   Siddharth Dawar, et al.
0

High-utility itemset mining finds itemsets from a transaction database with utility no less than a fixed user-defined threshold. The utility of an itemset is defined as the sum of the utilities of its item. Several algorithms were proposed to mine high-utility itemsets. However, no state-of-the-art algorithm performs consistently good across dense and sparse datasets. In this paper, we propose a novel data structure called Utility-Tree, and a tree-based algorithm called UT-Miner that mines high-utility itemsets in one-phase only without generating any candidates and uses a lightweight construction method to reduce the cost of creating projected databases during the search space exploration. The transaction information is stored compactly with every node of the Utility-Tree, and the information is computed efficiently during the recursive invocation of the algorithm. Experimental results on several real-life dense and sparse datasets reveal that UT-Miner is among the top-performing efficient algorithms across different datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2021

TargetUM: Targeted High-Utility Itemset Querying

Traditional high-utility itemset mining (HUIM) aims to determine all hig...
research
12/18/2018

High-utility itemset mining for subadditive monotone utility functions

High-utility Itemset Mining (HUIM) finds itemsets from a transaction dat...
research
08/18/2020

Discovering High Utility-Occupancy Patterns from Uncertain Data

It is widely known that there is a lot of useful information hidden in b...
research
02/25/2019

Beyond Frequency: Utility Mining with Varied Item-Specific Minimum Utility

Utility-oriented mining which integrates utility theory and data mining ...
research
04/21/2023

Mining Privacy-Preserving Association Rules based on Parallel Processing in Cloud Computing

With the onset of the Information Era and the rapid growth of informatio...
research
06/28/2021

TOPIC: Top-k High-Utility Itemset Discovering

Utility-driven itemset mining is widely applied in many real-world scena...
research
03/30/2021

TUSQ: Targeted High-Utility Sequence Querying

Significant efforts have been expended in the research and development o...

Please sign up or login with your details

Forgot password? Click here to reset