Comparison of machine learning models applied on anonymized data with different techniques

05/12/2023
by   Judith Sainz-Pardo Díaz, et al.
0

Anonymization techniques based on obfuscating the quasi-identifiers by means of value generalization hierarchies are widely used to achieve preset levels of privacy. To prevent different types of attacks against database privacy it is necessary to apply several anonymization techniques beyond the classical k-anonymity or ℓ-diversity. However, the application of these methods is directly connected to a reduction of their utility in prediction and decision making tasks. In this work we study four classical machine learning methods currently used for classification purposes in order to analyze the results as a function of the anonymization techniques applied and the parameters selected for each of them. The performance of these models is studied when varying the value of k for k-anonymity and additional tools such as ℓ-diversity, t-closeness and δ-disclosure privacy are also deployed on the well-known adult dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/25/2017

Towards Measuring Membership Privacy

Machine learning models are increasingly made available to the masses th...
research
02/27/2022

Attacks on Deidentification's Defenses

Quasi-identifier-based deidentification techniques (QI-deidentification)...
research
02/05/2022

Linear Model with Local Differential Privacy

Scientific collaborations benefit from collaborative learning of distrib...
research
08/16/2022

pyCANON: A Python library to check the level of anonymity of a dataset

Openly sharing data with sensitive attributes and privacy restrictions i...
research
03/14/2018

Machine learning-assisted virtual patching of web applications

Web applications are permanently being exposed to attacks that exploit t...
research
08/17/2022

On the Privacy Effect of Data Enhancement via the Lens of Memorization

Machine learning poses severe privacy concerns as it is shown that the l...
research
07/04/2018

Diversity in Machine Learning

Machine learning methods have achieved good performance and been widely ...

Please sign up or login with your details

Forgot password? Click here to reset