Libra: High-Utility Anonymization of Event Logs for Process Mining via Subsampling

06/27/2022
by   Gamal Elkoumy, et al.
0

Process mining techniques enable analysts to identify and assess process improvement opportunities based on event logs. A common roadblock to process mining is that event logs may contain private information that cannot be used for analysis without consent. An approach to overcome this roadblock is to anonymize the event log so that no individual represented in the original log can be singled out based on the anonymized one. Differential privacy is an anonymization approach that provides this guarantee. A differentially private event log anonymization technique seeks to produce an anonymized log that is as similar as possible to the original one (high utility) while providing a required privacy guarantee. Existing event log anonymization techniques operate by injecting noise into the traces in the log (e.g., duplicating, perturbing, or filtering out some traces). Recent work on differential privacy has shown that a better privacy-utility tradeoff can be achieved by applying subsampling prior to noise injection. In other words, subsampling amplifies privacy. This paper proposes an event log anonymization approach called Libra that exploits this observation. Libra extracts multiple samples of traces from a log, independently injects noise, retains statistically relevant traces from each sample, and composes the samples to produce a differentially private log. An empirical evaluation shows that the proposed approach leads to a considerably higher utility for equivalent privacy guarantees relative to existing baselines.

READ FULL TEXT
research
03/22/2021

Mine Me but Don't Single Me Out: Differentially Private Event Logs for Process Mining

The applicability of process mining techniques hinges on the availabilit...
research
09/17/2021

SaCoFa: Semantics-aware Control-flow Anonymization for Process Mining

Privacy-preserving process mining enables the analysis of business proce...
research
12/02/2020

Privacy-Preserving Directly-Follows Graphs: Balancing Risk and Utility in Process Mining

Process mining techniques enable organizations to analyze business proce...
research
01/09/2022

Differentially Private Release of Event Logs for Process Mining

The applicability of process mining techniques hinges on the availabilit...
research
07/14/2021

A Distance Measure for Privacy-preserving Process Mining based on Feature Learning

To enable process analysis based on an event log without compromising th...
research
03/29/2023

TraVaG: Differentially Private Trace Variant Generation Using GANs

Process mining is rapidly growing in the industry. Consequently, privacy...
research
10/20/2022

TraVaS: Differentially Private Trace Variant Selection for Process Mining

In the area of industrial process mining, privacy-preserving event data ...

Please sign up or login with your details

Forgot password? Click here to reset