A Formal Privacy Framework for Partially Private Data

04/03/2022
by   Jeremy Seeman, et al.
0

Despite its many useful theoretical properties, differential privacy (DP) has one substantial blind spot: any release that non-trivially depends on confidential data without additional privacy-preserving randomization fails to satisfy DP. Such a restriction is rarely met in practice, as most data releases under DP are actually "partially private" data (PPD). This poses a significant barrier to accounting for privacy risk and utility under logistical constraints imposed on data curators, especially those working with official statistics. In this paper, we propose a privacy definition which accommodates PPD and prove it maintains similar properties to standard DP. We derive optimal transport-based mechanisms for releasing PPD that satisfy our definition and algorithms for valid statistical inference using PPD, demonstrating their improved performance over post-processing methods. Finally, we apply these methods to a case study on US Census and CDC PPD to investigate private COVID-19 infection rates. In doing so, we show how data curators can use our framework to overcome barriers to operationalizing formal privacy while providing more transparency and accountability to users.

READ FULL TEXT

page 18

page 21

page 23

research
05/27/2022

Auditing Differential Privacy in High Dimensions with the Kernel Quantum Rényi Divergence

Differential privacy (DP) is the de facto standard for private data rele...
research
05/06/2022

Statistical Data Privacy: A Song of Privacy and Utility

To quantify trade-offs between increasing demand for open data sharing a...
research
10/15/2021

The Privacy-preserving Padding Problem: Non-negative Mechanisms for Conservative Answers with Differential Privacy

Differentially private noise mechanisms commonly use symmetric noise dis...
research
09/11/2020

Intertwining Order Preserving Encryption and Differential Privacy

Ciphertexts of an order-preserving encryption (OPE) scheme preserve the ...
research
11/17/2017

On the Existence of Densities for Functional Data and their Link to Statistical Privacy

In statistical privacy (or statistical disclosure control) the goal is t...
research
08/09/2021

Canonical Noise Distributions and Private Hypothesis Tests

f-DP has recently been proposed as a generalization of classical definit...
research
10/28/2021

Privacy Preserving Inference on the Ratio of Two Gaussians Using (Weighted) Sums

The ratio of two Gaussians is useful in many contexts of statistical inf...

Please sign up or login with your details

Forgot password? Click here to reset