Privacy-Utility Trade-off of Linear Regression under Random Projections and Additive Noise

02/13/2019
by   Mehrdad Showkatbakhsh, et al.
18

Data privacy is an important concern in machine learning, and is fundamentally at odds with the task of training useful learning models, which typically require the acquisition of large amounts of private user data. One possible way of fulfilling the machine learning task while preserving user privacy is to train the model on a transformed, noisy version of the data, which does not reveal the data itself directly to the training procedure. In this work, we analyze the privacy-utility trade-off of two such schemes for the problem of linear regression: additive noise, and random projections. In contrast to previous work, we consider a recently proposed notion of differential privacy that is based on conditional mutual information (MI-DP), which is stronger than the conventional (ϵ, δ)-differential privacy, and use relative objective error as the utility metric. We find that projecting the data to a lower-dimensional subspace before adding noise attains a better trade-off in general. We also make a connection between privacy problem and (non-coherent) SIMO, which has been extensively studied in wireless communication, and use tools from there for the analysis. We present numerical results demonstrating the performance of the schemes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2022

Differential Private Discrete Noise Adding Mechanism: Conditions, Properties and Optimization

Differential privacy is a standard framework to quantify the privacy los...
research
04/26/2022

Privacy-Utility Trade-Off

In this paper, we investigate the privacy-utility trade-off (PUT) proble...
research
12/02/2020

Generating private data with user customization

Personal devices such as mobile phones can produce and store large amoun...
research
06/28/2019

Utility-Preserving Privacy Mechanisms for Counting Queries

Differential privacy (DP) and local differential privacy (LPD) are frame...
research
11/20/2022

Learning to Generate Image Embeddings with User-level Differential Privacy

Small on-device models have been successfully trained with user-level di...
research
10/26/2020

Strong Privacy and Utility Guarantee: Over-the-Air Statistical Estimation

We consider the privacy problem of statistical estimation from distribut...
research
10/20/2019

Leveraging Hierarchical Representations for Preserving Privacy and Utility in Text

Guaranteeing a certain level of user privacy in an arbitrary piece of te...

Please sign up or login with your details

Forgot password? Click here to reset