PM4Py-GPU: a High-Performance General-Purpose Library for Process Mining

04/11/2022
by   Alessandro Berti, et al.
0

Open-source process mining provides many algorithms for the analysis of event data which could be used to analyze mainstream processes (e.g., O2C, P2P, CRM). However, compared to commercial tools, they lack the performance and struggle to analyze large amounts of data. This paper presents PM4Py-GPU, a Python process mining library based on the NVIDIA RAPIDS framework. Thanks to the dataframe columnar storage and the high level of parallelism, a significant speed-up is achieved on classic process mining computations and processing activities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2019

Increasing Scalability of Process Mining using Event Dataframes: How Data Structure Matters

Process Mining is a branch of Data Science that aims to extract process-...
research
05/15/2019

Process Mining for Python (PM4Py): Bridging the Gap Between Process- and Data Science

Process mining, i.e., a sub-field of data science focusing on the analys...
research
11/09/2019

DataSist: A Python-based library for easy data analysis, visualization and modeling

A large amount of data is produced every second from modern information ...
research
05/29/2021

Dash Sylvereye: A WebGL-powered Library for Dashboard-driven Visualization of Large Street Networks

State-of-the-art open network visualization tools like Gephi, KeyLines, ...
research
10/22/2021

Signal-Envelope: A C++ library with Python bindings for temporal envelope estimation

Signals can be interpreted as composed of a rapidly varying component mo...
research
07/11/2023

Mining for Unknown Unknowns

Unknown unknowns are future relevant contingencies that lack an ex ante ...
research
12/09/2021

GPU backed Data Mining on Android Devices

Choosing an appropriate programming paradigm for high-performance comput...

Please sign up or login with your details

Forgot password? Click here to reset