Accuracy Gains from Privacy Amplification Through Sampling for Differential Privacy

03/17/2021
by   Jingchen Hu, et al.
0

Recent research in differential privacy demonstrated that (sub)sampling can amplify the level of protection. For example, for ϵ-differential privacy and simple random sampling with sampling rate r, the actual privacy guarantee is approximately rϵ, if a value of ϵ is used to protect the output from the sample. In this paper, we study whether this amplification effect can be exploited systematically to improve the accuracy of the privatized estimate. Specifically, assuming the agency has information for the full population, we ask under which circumstances accuracy gains could be expected, if the privatized estimate would be computed on a random sample instead of the full population. We find that accuracy gains can be achieved for certain regimes. However, gains can typically only be expected, if the sensitivity of the output with respect to small changes in the database does not depend too strongly on the size of the database. We only focus on algorithms that achieve differential privacy by adding noise to the final output and illustrate the accuracy implications for two commonly used statistics: the mean and the median. We see our research as a first step towards understanding the conditions required for accuracy gains in practice and we hope that these findings will stimulate further research broadening the scope of differential privacy algorithms and outputs considered.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2022

Achieving Differential Privacy with Matrix Masking in Big Data

Differential privacy schemes have been widely adopted in recent years to...
research
03/07/2022

Differential Privacy Amplification in Quantum and Quantum-inspired Algorithms

Differential privacy provides a theoretical framework for processing a d...
research
11/08/2021

Distribution-Invariant Differential Privacy

Differential privacy is becoming one gold standard for protecting the pr...
research
06/04/2020

Median regression with differential privacy

Median regression analysis has robustness properties which make it attra...
research
05/12/2023

Making Differential Privacy Work for Census Data Users

The U.S. Census Bureau collects and publishes detailed demographic data ...
research
10/04/2017

Differentially Private Database Release via Kernel Mean Embeddings

We lay theoretical foundations for new database release mechanisms that ...
research
01/24/2023

Database Reconstruction Is Not So Easy and Is Different from Reidentification

In recent years, it has been claimed that releasing accurate statistical...

Please sign up or login with your details

Forgot password? Click here to reset