Survey on Privacy-Preserving Techniques for Data Publishing

01/20/2022
by   Tânia Carvalho, et al.
0

The exponential growth of collected, processed, and shared microdata has given rise to concerns about individuals' privacy. As a result, laws and regulations have emerged to control what organisations do with microdata and how they protect it. Statistical Disclosure Control seeks to reduce the risk of confidential information disclosure by de-identifying them. Such de-identification is guaranteed through privacy-preserving techniques. However, de-identified data usually results in loss of information, with a possible impact on data analysis precision and model predictive performance. The main goal is to protect the individuals' privacy while maintaining the interpretability of the data, i.e. its usefulness. Statistical Disclosure Control is an area that is expanding and needs to be explored since there is still no solution that guarantees optimal privacy and utility. This survey focuses on all steps of the de-identification process. We present existing privacy-preserving techniques used in microdata de-identification, privacy measures suitable for several disclosure types and, information loss and predictive performance measures. In this survey, we discuss the main challenges raised by privacy constraints, describe the main approaches to handle these obstacles, review taxonomies of privacy-preserving techniques, provide a theoretical analysis of existing comparative studies, and raise multiple open issues.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/13/2022

Towards a Data Privacy-Predictive Performance Trade-off

Machine learning is increasingly used in the most diverse applications a...
research
01/25/2021

Privacy Preserving Techniques Applied to CPNI Data: Analysis and Recommendations

With mobile phone penetration rates reaching 90 Network Information (CPN...
research
03/16/2021

No Intruder, no Validity: Evaluation Criteria for Privacy-Preserving Text Anonymization

For sensitive text data to be shared among NLP researchers and practitio...
research
12/01/2022

Privacy-Preserving Data Synthetisation for Secure Information Sharing

We can protect user data privacy via many approaches, such as statistica...
research
08/13/2018

Review of Different Privacy Preserving Techniques in PPDP

Big data is a term used for a very large data sets that have many diffic...
research
04/10/2023

Privacy-preserving Inference of Group Mean Difference in Zero-inflated Right Skewed Data with Partitioning and Censoring

We examine privacy-preserving inferences of group mean differences in ze...

Please sign up or login with your details

Forgot password? Click here to reset