HyObscure: Hybrid Obscuring for Privacy-Preserving Data Publishing

12/15/2021
by   Xiao Han, et al.
0

Minimizing privacy leakage while ensuring data utility is a critical problem to data holders in a privacy-preserving data publishing task. Most prior research concerns only with one type of data and resorts to a single obscuring method, , obfuscation or generalization, to achieve a privacy-utility tradeoff, which is inadequate for protecting real-life heterogeneous data and is hard to defend ever-growing machine learning based inference attacks. This work takes a pilot study on privacy-preserving data publishing when both generalization and obfuscation operations are employed for heterogeneous data protection. To this end, we first propose novel measures for privacy and utility quantification and formulate the hybrid privacy-preserving data obscuring problem to account for the joint effect of generalization and obfuscation. We then design a novel hybrid protection mechanism called HyObscure, to cross-iteratively optimize the generalization and obfuscation operations for maximum privacy protection under a certain utility guarantee. The convergence of the iterative process and the privacy leakage bound of HyObscure are also provided in theory. Extensive experiments demonstrate that HyObscure significantly outperforms a variety of state-of-the-art baseline methods when facing various inference attacks under different scenarios. HyObscure also scales linearly to the data size and behaves robustly with varying key parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2021

Equity and Privacy: More Than Just a Tradeoff

While the entire field of privacy preserving data analytics is focused o...
research
11/10/2019

Prospect Theoretic Analysis of Privacy-Preserving Mechanism

We study a problem of privacy-preserving mechanism design. A data collec...
research
02/09/2020

Target Privacy Preserving for Social Networks

In this paper, we incorporate the realistic scenario of key protection i...
research
11/22/2019

Adversarial Learning of Privacy-Preserving and Task-Oriented Representations

Data privacy has emerged as an important issue as data-driven deep learn...
research
08/25/2020

Privacy-Preserving Data Publishing via Mutual Cover

We study anonymization techniques for preserving privacy in the publicat...
research
12/04/2018

Hybrid Microaggregation for Privacy-Preserving Data Mining

k-Anonymity by microaggregation is one of the most commonly used anonymi...
research
06/08/2020

An operational architecture for privacy-by-design in public service applications

Governments around the world are trying to build large data registries f...

Please sign up or login with your details

Forgot password? Click here to reset