Differential Privacy with Random Projections and Sign Random Projections

05/22/2023
by   Ping Li, et al.
0

In this paper, we develop a series of differential privacy (DP) algorithms from a family of random projections (RP), for general applications in machine learning, data mining, and information retrieval. Among the presented algorithms, iDP-SignRP is remarkably effective under the setting of “individual differential privacy” (iDP), based on sign random projections (SignRP). Also, DP-SignOPORP considerably improves existing algorithms in the literature under the standard DP setting, using “one permutation + one random projection” (OPORP), where OPORP is a variant of the celebrated count-sketch method with fixed-length binning and normalization. Without taking signs, among the DP-RP family, DP-OPORP achieves the best performance. The concept of iDP (individual differential privacy) is defined only on a particular dataset of interest. While iDP is not strictly DP, iDP might be useful in certain applications, such as releasing a dataset (including sharing embeddings across companies or countries). In our study, we find that iDP-SignRP is remarkably effective for search and machine learning applications, in that the utilities are exceptionally good even at a very small privacy parameter ϵ (e.g., ϵ<0.5).

READ FULL TEXT
research
11/04/2020

The Limits of Differential Privacy (and its Misuse in Data Release and Machine Learning)

Differential privacy (DP) is a neat privacy definition that can co-exist...
research
07/31/2018

Subsampled Rényi Differential Privacy and Analytical Moments Accountant

We study the problem of subsampling in differential privacy (DP), a ques...
research
11/02/2019

Relations among different privacy notions

We present a comprehensive view of the relations among several privacy n...
research
07/07/2023

Random Number Generators and Seeding for Differential Privacy

Differential Privacy (DP) relies on random numbers to preserve privacy, ...
research
06/13/2023

Differentially Private One Permutation Hashing and Bin-wise Consistent Weighted Sampling

Minwise hashing (MinHash) is a standard algorithm widely used in the ind...
research
09/06/2020

Randomness Concerns When Deploying Differential Privacy

The U.S. Census Bureau is using differential privacy (DP) to protect con...
research
02/07/2023

OPORP: One Permutation + One Random Projection

Consider two D-dimensional data vectors (e.g., embeddings): u, v. In man...

Please sign up or login with your details

Forgot password? Click here to reset