An empirical investigation of different classifiers, encoding and ensemble schemes for next event prediction using business process event logs

08/24/2020
by   Bayu Adhi Tama, et al.
0

There is a growing need for empirical benchmarks that support researchers and practitioners in selecting the best machine learning technique for given prediction tasks. In this paper, we consider the next event prediction task in business process predictive monitoring and we extend our previously published benchmark by studying the impact on the performance of different encoding windows and of using ensemble schemes. The choice of whether to use ensembles and which scheme to use often depends on the type of data and classification task. While there is a general understanding that ensembles perform well in predictive monitoring of business processes, next event prediction is a task for which no other benchmarks involving ensembles are available. The proposed benchmark helps researchers to select a high performing individual classifier or ensemble scheme given the variability at the case level of the event log under consideration. Experimental results show that choosing an optimal number of events for feature encoding is challenging, resulting in the need to consider each event log individually when selecting an optimal value. Ensemble schemes improve the performance of low performing classifiers in this task, such as SVM, whereas high performing classifiers, such as tree-based classifiers, are not better off when ensemble schemes are considered.

READ FULL TEXT
research
05/03/2020

An empirical comparison of deep-neural-network architectures for next activity prediction using context-enriched process event logs

Researchers have proposed a variety of predictive business process monit...
research
01/18/2021

E Pluribus Unum Ex Machina: Learning from Many Collider Events at Once

There have been a number of recent proposals to enhance the performance ...
research
01/06/2021

The Shapley Value of Classifiers in Ensemble Games

How do we decide the fair value of individual classifiers in an ensemble...
research
04/02/2018

Specification-Driven Multi-Perspective Predictive Business Process Monitoring (Extended Version)

Predictive analysis in business process monitoring aims at forecasting t...
research
02/22/2023

Impact of Event Encoding and Dissimilarity Measures on Traffic Crash Characterization Based on Sequence of Events

Crash sequence analysis has been shown in prior studies to be useful for...
research
12/08/2017

Blind Multi-class Ensemble Learning with Unequally Reliable Classifiers

The rising interest in pattern recognition and data analytics has spurre...

Please sign up or login with your details

Forgot password? Click here to reset