Process Mining for Python (PM4Py): Bridging the Gap Between Process- and Data Science

05/15/2019
by   Alessandro Berti, et al.
0

Process mining, i.e., a sub-field of data science focusing on the analysis of event data generated during the execution of (business) processes, has seen a tremendous change over the past two decades. Starting off in the early 2000's, with limited to no tool support, nowadays, several software tools, i.e., both open-source, e.g., ProM and Apromore, and commercial, e.g., Disco, Celonis, ProcessGold, etc., exist. The commercial process mining tools provide limited support for implementing custom algorithms. Moreover, both commercial and open-source process mining tools are often only accessible through a graphical user interface, which hampers their usage in large-scale experimental settings. Initiatives such as RapidProM provide process mining support in the scientific workflow-based data science suite RapidMiner. However, these offer limited to no support for algorithmic customization. In the light of the aforementioned, in this paper, we present a novel process mining library, i.e. Process Mining for Python (PM4Py) that aims to bridge this gap, providing integration with state-of-the-art data science libraries, e.g., pandas, numpy, scipy and scikit-learn. We provide a global overview of the architecture and functionality of PM4Py, accompanied by some representative examples of its usage.

READ FULL TEXT

page 1

page 3

research
04/11/2022

PM4Py-GPU: a High-Performance General-Purpose Library for Process Mining

Open-source process mining provides many algorithms for the analysis of ...
research
08/25/2022

Cloud Process Execution Engine: Architecture and Interfaces

Process Execution Engines are a vital part of Business Process Managemen...
research
07/28/2020

A Process Mining Software Comparison

www.processmining-software.com is a dedicated website for process mining...
research
04/10/2020

On Strong Scaling and Open Source Tools for Analyzing Atom Probe Tomography Data

Atom probe tomography (APT) has matured to a versatile nanoanalytical ch...
research
02/26/2021

GraphSense: A General-Purpose Cryptoasset Analytics Platform

There is currently an increasing demand for cryptoasset analysis tools a...
research
08/24/2022

A Survey of Open Source Automation Tools for Data Science Predictions

We present an expository overview of technical and cultural challenges t...
research
05/10/2021

A practical, effective calculation of gamma difference distributions with open data science tools

At present, there is still no officially accepted and extensively verifi...

Please sign up or login with your details

Forgot password? Click here to reset