A Survey of Parallel Sequential Pattern Mining

05/26/2018
by   Wensheng Gan, et al.
0

With the growing popularity of resource sharing and shared resources, large volumes of complex data of different types are collected automatically. Traditional data mining algorithms generally have problems and challenges including huge memory cost, low processing speed, and inadequate hard disk space. For sequential pattern mining (SPM), it is used in a wide variety of real-life applications. However, it is more complex and challenging than frequent itemset mining, and also suffers from the above challenges when handling the large-scale data. To solve these problems, mining sequential patterns in a parallel computing environment has emerged as an important issue with many applications. In this paper, an in-depth survey of the current status of parallel sequential pattern mining (PSPM) is investigated and provided, including detailed categorization of traditional serial SPM approaches, and state of the art parallel SPM. We review the related work of PSPM in detail, including partition-based algorithms for PSPM, Apriori-based PSPM, pattern growth based PSPM, and hybrid algorithms for PSPM, and provide deep description (i.e., characteristics, advantages, and disadvantages) of each parallel approach of PSPM. Some advanced topics for PSPM and the related open-source software are further reviewed in details. Finally, we summarize some challenges and opportunities of PSPM in the big data era.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2022

Contrast Pattern Mining: A Survey

Contrast pattern mining (CPM) is an important and popular subfield of da...
research
05/26/2018

A Survey of Utility-Oriented Pattern Mining

The main purpose of data mining and analytics is to find novel, potentia...
research
08/11/2021

Parallel algorithms for mining of frequent itemsets

In the recent decade companies started collecting of large amount of dat...
research
02/06/2022

Memory Efficient Tries for Sequential Pattern Mining

The rapid and continuous growth of data has increased the need for scala...
research
01/19/2022

All one needs to know about shared micromobility simulation: a complete survey

As the shared micromobility becomes a part of our daily life and environ...
research
08/09/2020

Big Networks: A Survey

A network is a typical expressive form of representing complex systems i...

Please sign up or login with your details

Forgot password? Click here to reset