A Distance Measure for Privacy-preserving Process Mining based on Feature Learning

07/14/2021
by   Fabian Rösel, et al.
0

To enable process analysis based on an event log without compromising the privacy of individuals involved in process execution, a log may be anonymized. Such anonymization strives to transform a log so that it satisfies provable privacy guarantees, while largely maintaining its utility for process analysis. Existing techniques perform anonymization using simple, syntactic measures to identify suitable transformation operations. This way, the semantics of the activities referenced by the events in a trace are neglected, potentially leading to transformations in which events of unrelated activities are merged. To avoid this and incorporate the semantics of activities during anonymization, we propose to instead incorporate a distance measure based on feature learning. Specifically, we show how embeddings of events enable the definition of a distance measure for traces to guide event log anonymization. Our experiments with real-world data indicate that anonymization using this measure, compared to a syntactic one, yields logs that are closer to the original log in various dimensions and, hence, have higher utility for process analysis.

READ FULL TEXT
research
09/17/2021

SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining

Privacy-preserving process mining enables the analysis of business proce...
research
06/23/2020

PRIPEL: Privacy-Preserving Event Log Publishing Including Contextual Information

Event logs capture the execution of business processes in terms of execu...
research
06/27/2022

Libra: High-Utility Anonymization of Event Logs for Process Mining via Subsampling

Process mining techniques enable analysts to identify and assess process...
research
12/21/2020

Towards Quantifying Privacy in Process Mining

Process mining employs event logs to provide insights into the actual pr...
research
07/18/2020

An Entropic Relevance Measure for Stochastic Conformance Checking in Process Mining

Given an event log as a collection of recorded real-world process traces...
research
05/01/2023

PMDG: Privacy for Multi-Perspective Process Mining through Data Generalization

Anonymization of event logs facilitates process mining while protecting ...
research
03/22/2021

Privacy-aware Process Performance Indicators: Framework and Release Mechanisms

Process performance indicators (PPIs) are metrics to quantify the degree...

Please sign up or login with your details

Forgot password? Click here to reset