Privacy-preserving parametric inference: a case for robust statistics

11/22/2019
by   Marco Avella-Medina, et al.
0

Differential privacy is a cryptographically-motivated approach to privacy that has become a very active field of research over the last decade in theoretical computer science and machine learning. In this paradigm one assumes there is a trusted curator who holds the data of individuals in a database and the goal of privacy is to simultaneously protect individual data while allowing the release of global characteristics of the database. In this setting we introduce a general framework for parametric inference with differential privacy guarantees. We first obtain differentially private estimators based on bounded influence M-estimators by leveraging their gross-error sensitivity in the calibration of a noise term added to them in order to ensure privacy. We then show how a similar construction can also be applied to construct differentially private test statistics analogous to the Wald, score and likelihood ratio tests. We provide statistical guarantees for all our proposals via an asymptotic analysis. An interesting consequence of our results is to further clarify the connection between differential privacy and robust statistics. In particular, we demonstrate that differential privacy is a weaker stability requirement than infinitesimal robustness, and show that robust M-estimators can be easily randomized in order to guarantee both differential privacy and robustness towards the presence of contaminated data. We illustrate our results both on simulated and real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2012

Convergence Rates for Differentially Private Statistical Estimation

Differential privacy is a cryptographically-motivated definition of priv...
research
06/25/2020

Identification and Formal Privacy Guarantees

Empirical economic research crucially relies on highly sensitive individ...
research
10/04/2017

Differentially Private Database Release via Kernel Mean Embeddings

We lay theoretical foundations for new database release mechanisms that ...
research
08/01/2023

Differentially Private Linear Regression with Linked Data

There has been increasing demand for establishing privacy-preserving met...
research
02/19/2023

Sample-efficient private data release for Lipschitz functions under sparsity assumptions

Differential privacy is the de facto standard for protecting privacy in ...
research
05/03/2019

Locally Differentially Private Naive Bayes Classification

In machine learning, classification models need to be trained in order t...
research
02/03/2023

From Robustness to Privacy and Back

We study the relationship between two desiderata of algorithms in statis...

Please sign up or login with your details

Forgot password? Click here to reset