An Uncertainty Principle is a Price of Privacy-Preserving Microdata

10/25/2021
by   John Abowd, et al.
0

Privacy-protected microdata are often the desired output of a differentially private algorithm since microdata is familiar and convenient for downstream users. However, there is a statistical price for this kind of convenience. We show that an uncertainty principle governs the trade-off between accuracy for a population of interest ("sum query") vs. accuracy for its component sub-populations ("point queries"). Compared to differentially private query answering systems that are not required to produce microdata, accuracy can degrade by a logarithmic factor. For example, in the case of pure differential privacy, without the microdata requirement, one can provide noisy answers to the sum query and all point queries while guaranteeing that each answer has squared error O(1/ϵ^2). With the microdata requirement, one must choose between allowing an additional log^2(d) factor (d is the number of point queries) for some point queries or allowing an extra O(d^2) factor for the sum query. We present lower bounds for pure, approximate, and concentrated differential privacy. We propose mitigation strategies and create a collection of benchmark datasets that can be used for public study of this problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/06/2014

Differentially Private Data Releasing for Smooth Queries with Synthetic Database Output

We consider accurately answering smooth queries while preserving differe...
research
11/30/2020

Optimizing Fitness-For-Use of Differentially Private Linear Queries

In practice, differentially private data releases are designed to suppor...
research
08/10/2018

Optimizing error of high-dimensional statistical queries under differential privacy

Differentially private algorithms for answering sets of predicate counti...
research
12/16/2020

On Avoiding the Union Bound When Answering Multiple Differentially Private Queries

In this work, we study the problem of answering k queries with (ϵ, δ)-di...
research
03/15/2021

A Central Limit Theorem for Differentially Private Query Answering

Perhaps the single most important use case for differential privacy is t...
research
04/19/2023

Sensitivity estimation for differentially private query processing

Differential privacy has become a popular privacy-preserving method in d...
research
06/30/2022

Imputation under Differential Privacy

The literature on differential privacy almost invariably assumes that th...

Please sign up or login with your details

Forgot password? Click here to reset