What Averages Do Not Tell – Predicting Real Life Processes with Sequential Deep Learning

10/19/2021
by   István Ketykó, et al.
0

Deep Learning is proven to be an effective tool for modeling sequential data as shown by the success in Natural Language, Computer Vision and Signal Processing. Process Mining concerns discovering insights on business processes from their execution data that are logged by supporting information systems. The logged data (event log) is formed of event sequences (traces) that correspond to executions of a process. Many Deep Learning techniques have been successfully adapted for predictive Process Mining that aims to predict process outcomes, remaining time, the next event, or even the suffix of running traces. Traces in Process Mining are multimodal sequences and very differently structured than natural language sentences or images. This may require a different approach to processing. So far, there has been little focus on these differences and the challenges introduced. Looking at suffix prediction as the most challenging of these tasks, the performance of Deep Learning models was evaluated only on average measures and for a small number of real-life event logs. Comparing the results between papers is difficult due to different pre-processing and evaluation strategies. Challenges that may be relevant are the skewness of trace-length distribution and the skewness of the activity distribution in real-life event logs. We provide an end-to-end framework which enables to compare the performance of seven state-of-the-art sequential architectures in common settings. Results show that sequence modeling still has a lot of room for improvement for majority of the more complex datasets. Further research and insights are required to get consistent performance not just in average measures but additionally over all the prefixes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2022

Clustering Object-Centric Event Logs

Process mining provides various algorithms to analyze process executions...
research
10/31/2018

An Interdisciplinary Comparison of Sequence Modeling Methods for Next-Element Prediction

Data of sequential nature arise in many application domains in forms of,...
research
09/08/2020

Discovering Generative Models from Event Logs: Data-driven Simulation vs Deep Learning

A generative model is a statistical model that is able to generate new d...
research
05/28/2023

Revisiting the Alpha Algorithm To Enable Real-Life Process Discovery Applications – Extended Report

The Alpha algorithm was the first process discovery algorithm that was a...
research
03/12/2019

DREAM-NAP: Decay Replay Mining to Predict Next Process Activities

In complex processes, various events can happen in different sequences. ...
research
01/22/2021

A systematic literature review on state-of-the-art deep learning methods for process prediction

Process mining enables the reconstruction and evaluation of business pro...
research
09/18/2023

A Discussion on Generalization in Next-Activity Prediction

Next activity prediction aims to forecast the future behavior of running...

Please sign up or login with your details

Forgot password? Click here to reset