Evaluation of Frequent Itemset Mining Platforms using Apriori and FP-Growth Algorithm

02/28/2019
by   Ravi Ranjan, et al.
0

With the overwhelming amount of complex and heterogeneous data pouring from any-where, any-time, and any-device, there is undeniably an era of Big Data. The emergence of the Big Data as a disruptive technology for next generation of intelligent systems, has brought many issues of how to extract and make use of the knowledge obtained from the data within short times, limited budget and under high rates of data generation. Companies are recognizing that big data can be used to make more accurate predictions, and can be used to enhance the business with the help of appropriate association rule mining algorithm. To help these organizations, with which software and algorithm is more appropriate for them depending on their dataset, we compared the most famous three MapReduce based software Hadoop, Spark, Flink on two widely used algorithms Apriori and Fp-Growth on different scales of dataset.

READ FULL TEXT
research
08/25/2018

Taxonomy of Big Data: A Survey

The Big Data is the most popular paradigm nowadays and it has almost no ...
research
04/15/2021

Introduction to Big data Technology

Big data is no more "all just hype" but widely applied in nearly all asp...
research
02/02/2022

Impact Analysis of Harassment Against Women Using Association Rule Mining Approaches: Bangladesh Prospective

In recent years, it has been noticed that women are making progress in e...
research
07/26/2018

EBIC: an open source software for high-dimensional and big data biclustering analyses

Motivation: In this paper we present the latest release of EBIC, a next-...
research
03/18/2018

A Guided FP-growth algorithm for fast mining of frequent itemsets from big data

In this paper we present the GFP-growth (Guided FP-growth) algorithm, a ...
research
09/10/2021

How Can Subgroup Discovery Help AIOps?

The genuine supervision of modern IT systems brings new challenges as it...
research
12/17/2017

A MapReduce-based rotation forest classifier for epileptic seizure prediction

In this era, big data applications including biomedical are becoming att...

Please sign up or login with your details

Forgot password? Click here to reset