Frequent Item-set Mining without Ubiquitous Items

03/29/2018
by   Ran M. Bittmann, et al.
0

Frequent Item-set Mining (FIM), sometimes called Market Basket Analysis (MBA) or Association Rule Learning (ARL), are Machine Learning (ML) methods for creating rules from datasets of transactions of items. Most methods identify items likely to appear together in a transaction based on the support (i.e. a minimum number of relative co-occurrence of the items) for that hypothesis. Although this is a good indicator to measure the relevance of the assumption that these items are likely to appear together, the phenomenon of very frequent items, referred to as ubiquitous items, is not addressed in most algorithms. Ubiquitous items have the same entropy as infrequent items, and not contributing significantly to the knowledge. On the other hand, they have strong effect on the performance of the algorithms and sometimes preventing the convergence of the FIM algorithms and thus the provision of meaningful results. This paper discusses the phenomenon of ubiquitous items and demonstrates how ignoring these has a dramatic effect on the computation performances but with a low and controlled effect on the significance of the results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2020

Recursive Association Rule Mining

Mining frequent itemsets and association rules is an essential task with...
research
09/28/2017

Measuring the Eccentricity of Items

The long-tail phenomenon tells us that there are many items in the tail....
research
06/18/2018

Mining frequent items in unstructured P2P networks

Large scale decentralized systems, such as P2P, sensor or IoT device net...
research
01/30/2017

Comparing Dataset Characteristics that Favor the Apriori, Eclat or FP-Growth Frequent Itemset Mining Algorithms

Frequent itemset mining is a popular data mining technique. Apriori, Ecl...
research
12/26/2010

Mining Multi-Level Frequent Itemsets under Constraints

Mining association rules is a task of data mining, which extracts knowle...
research
08/07/2023

POSIT: Promotion of Semantic Item Tail via Adversarial Learning

In many recommender problems, a handful of popular items (e.g. movies/TV...
research
05/05/1999

DRAFT : Task System and Item Architecture (TSIA)

During its execution, a task is independent of all other tasks. For an a...

Please sign up or login with your details

Forgot password? Click here to reset