Differential privacy and robust statistics in high dimensions

11/12/2021
by   Xiyang Liu, et al.
0

We introduce a universal framework for characterizing the statistical efficiency of a statistical estimation problem with differential privacy guarantees. Our framework, which we call High-dimensional Propose-Test-Release (HPTR), builds upon three crucial components: the exponential mechanism, robust statistics, and the Propose-Test-Release mechanism. Gluing all these together is the concept of resilience, which is central to robust statistical estimation. Resilience guides the design of the algorithm, the sensitivity analysis, and the success probability analysis of the test step in Propose-Test-Release. The key insight is that if we design an exponential mechanism that accesses the data only via one-dimensional robust statistics, then the resulting local sensitivity can be dramatically reduced. Using resilience, we can provide tight local sensitivity bounds. These tight bounds readily translate into near-optimal utility guarantees in several cases. We give a general recipe for applying HPTR to a given instance of a statistical estimation problem and demonstrate it on canonical problems of mean estimation, linear regression, covariance estimation, and principal component analysis. We introduce a general utility analysis technique that proves that HPTR nearly achieves the optimal sample complexity under several scenarios studied in the literature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2020

Local Dampening: Differential Privacy for Non-numeric Queries via Local Sensitivity

Differential privacy is the state-of-the-art formal definition for data ...
research
11/19/2019

The Power of Factorization Mechanisms in Local and Central Differential Privacy

We give new characterizations of the sample complexity of answering line...
research
01/22/2021

The Privacy-Utility Tradeoff of Robust Local Differential Privacy

We consider data release protocols for data X=(S,U), where S is sensitiv...
research
01/30/2019

Benefits and Pitfalls of the Exponential Mechanism with Applications to Hilbert Spaces and Functional PCA

The exponential mechanism is a fundamental tool of Differential Privacy ...
research
02/18/2021

Robust and Differentially Private Mean Estimation

Differential privacy has emerged as a standard requirement in a variety ...
research
02/12/2019

The Cost of Privacy: Optimal Rates of Convergence for Parameter Estimation with Differential Privacy

Privacy-preserving data analysis is a rising challenge in contemporary s...
research
04/23/2018

Individual Sensitivity Preprocessing for Data Privacy

The sensitivity metric in differential privacy, which is informally defi...

Please sign up or login with your details

Forgot password? Click here to reset