Data Masking with Privacy Guarantees

01/08/2019
by   Anh T. Pham, et al.
0

We study the problem of data release with privacy, where data is made available with privacy guarantees while keeping the usability of the data as high as possible --- this is important in health-care and other domains with sensitive data. In particular, we propose a method of masking the private data with privacy guarantee while ensuring that a classifier trained on the masked data is similar to the classifier trained on the original data, to maintain usability. We analyze the theoretical risks of the proposed method and the traditional input perturbation method. Results show that the proposed method achieves lower risk compared to the input perturbation, especially when the number of training samples gets large. We illustrate the effectiveness of the proposed method of data masking for privacy-sensitive learning on 12 benchmark datasets.

READ FULL TEXT
research
06/22/2020

P3GM: Private High-Dimensional Data Release via Privacy Preserving Phased Generative Model

How can we release a massive volume of sensitive data while mitigating p...
research
11/03/2021

Privately Publishable Per-instance Privacy

We consider how to privately share the personalized privacy losses incur...
research
02/20/2020

Input Perturbation: A New Paradigm between Central and Local Differential Privacy

Traditionally, there are two models on differential privacy: the central...
research
07/30/2019

Privacy-preserving Distributed Machine Learning via Local Randomization and ADMM Perturbation

With the proliferation of training data, distributed machine learning (D...
research
10/24/2017

Synthetic Data for Social Good

Data for good implies unfettered access to data. But data owners must be...
research
01/16/2023

Enforcing Privacy in Distributed Learning with Performance Guarantees

We study the privatization of distributed learning and optimization stra...
research
04/21/2023

Power to the Data Defenders: Human-Centered Disclosure Risk Calibration of Open Data

The open data ecosystem is susceptible to vulnerabilities due to disclos...

Please sign up or login with your details

Forgot password? Click here to reset