HUOPM: High Utility Occupancy Pattern Mining

12/28/2018
by   Wensheng Gan, et al.
0

Mining useful patterns from varied types of databases is an important research topic, which has many real-life applications. Most studies have considered the frequency as sole interestingness measure for identifying high quality patterns. However, each object is different in nature. The relative importance of objects is not equal, in terms of criteria such as the utility, risk, or interest. Besides, another limitation of frequent patterns is that they generally have a low occupancy, i.e., they often represent small sets of items in transactions containing many items, and thus may not be truly representative of these transactions. To extract high quality patterns in real life applications, this paper extends the occupancy measure to also assess the utility of patterns in transaction databases. We propose an efficient algorithm named High Utility Occupancy Pattern Mining (HUOPM). It considers user preferences in terms of frequency, utility, and occupancy. A novel Frequency-Utility tree (FU-tree) and two compact data structures, called the utility-occupancy list and FU-table, are designed to provide global and partial downward closure properties for pruning the search space. The proposed method can efficiently discover the complete set of high quality patterns without candidate generation. Extensive experiments have been conducted on several datasets to evaluate the effectiveness and efficiency of the proposed algorithm. Results show that the derived patterns are intelligible, reasonable and acceptable, and that HUOPM with its pruning strategies outperforms the state-of-the-art algorithm, in terms of runtime and search space, respectively.

READ FULL TEXT
research
08/18/2020

Discovering High Utility-Occupancy Patterns from Uncertain Data

It is widely known that there is a lot of useful information hidden in b...
research
12/20/2022

Towards Sequence Utility Maximization under Utility Occupancy Measure

The discovery of utility-driven patterns is a useful and difficult resea...
research
11/24/2021

Flexible Pattern Discovery and Analysis

Based on the analysis of the proportion of utility in the supporting tra...
research
02/25/2019

Utility-driven Data Analytics on Uncertain Data

Modern Internet of Things (IoT) applications generate massive amounts of...
research
06/16/2020

Decomposable Families of Itemsets

The problem of selecting a small, yet high quality subset of patterns fr...
research
02/25/2019

Beyond Frequency: Utility Mining with Varied Item-Specific Minimum Utility

Utility-oriented mining which integrates utility theory and data mining ...
research
12/26/2020

Discovering Closed and Maximal Embedded Patterns from Large Tree Data

We address the problem of summarizing embedded tree patterns extracted f...

Please sign up or login with your details

Forgot password? Click here to reset