Controlling Privacy Loss in Survey Sampling (Working Paper)

07/24/2020
by   Mark Bun, et al.
0

Social science and economics research is often based on data collected in surveys. Due to time and budgetary constraints, this data is often collected using complex sampling schemes designed to increase accuracy while reducing the costs of data collection. A commonly held belief is that the sampling process affords the data subjects some additional privacy. This intuition has been formalized in the differential privacy literature for simple random sampling: a differentially private mechanism run on a simple random subsample of a population provides higher privacy guarantees than when run on the entire population. In this work we initiate the study of the privacy implications of more complicated sampling schemes including cluster sampling and stratified sampling. We find that not only do these schemes often not amplify privacy, but that they can result in privacy degradation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2018

Privacy Amplification by Subsampling: Tight Analyses via Couplings and Divergences

Differential privacy comes equipped with multiple analytical tools for t...
research
10/27/2016

Differentially Private Variational Inference for Non-conjugate Models

Many machine learning applications are based on data collected from peop...
research
12/23/2020

Hiding Among the Clones: A Simple and Nearly Optimal Analysis of Privacy Amplification by Shuffling

Recent work of Erlingsson, Feldman, Mironov, Raghunathan, Talwar, and Th...
research
01/22/2021

New randomized response technique for estimating the population total of a quantitative variable

In this paper, a new randomized response technique aimed at protecting r...
research
06/24/2018

On The Differential Privacy of Thompson Sampling With Gaussian Prior

We show that Thompson Sampling with Gaussian Prior as detailed by Algori...
research
10/25/2020

Differentially Private Weighted Sampling

Common datasets have the form of elements with keys (e.g., transactions ...
research
08/07/2017

Reallocating and Resampling: A Comparison for Inference

Simulation-based inference plays a major role in modern statistics, and ...

Please sign up or login with your details

Forgot password? Click here to reset