Robust and Differentially Private Mean Estimation

02/18/2021
by   Xiyang Liu, et al.
0

Differential privacy has emerged as a standard requirement in a variety of applications ranging from the U.S. Census to data collected in commercial devices, initiating an extensive line of research in accurately and privately releasing statistics of a database. An increasing number of such databases consist of data from multiple sources, not all of which can be trusted. This leaves existing private analyses vulnerable to attacks by an adversary who injects corrupted data. Despite the significance of designing algorithms that guarantee privacy and robustness (to a fraction of data being corrupted) simultaneously, even the simplest questions remain open. For the canonical problem of estimating the mean from i.i.d. samples, we introduce the first efficient algorithm that achieves both privacy and robustness for a wide range of distributions. This achieves optimal accuracy matching the known lower bounds for robustness, but the sample complexity has a factor of d^1/2 gap from known lower bounds. We further show that this gap is due to the computational efficiency; we introduce the first family of algorithms that close this gap but takes exponential time. The innovation is in exploiting resilience (a key property in robust estimation) to adaptively bound the sensitivity and improve privacy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2020

Private Mean Estimation of Heavy-Tailed Distributions

We give new upper and lower bounds on the minimax sample complexity of d...
research
12/09/2022

Robustness Implies Privacy in Statistical Estimation

We study the relationship between adversarial robustness and differentia...
research
11/01/2022

Privacy Induces Robustness: Information-Computation Gaps and Sparse Mean Estimation

We establish a simple connection between robust and differentially-priva...
research
01/30/2023

Near Optimal Private and Robust Linear Regression

We study the canonical statistical estimation problem of linear regressi...
research
10/17/2020

Locally Differentially Private Analysis of Graph Statistics

Differentially private analysis of graphs is widely used for releasing s...
research
11/12/2021

Differential privacy and robust statistics in high dimensions

We introduce a universal framework for characterizing the statistical ef...
research
01/29/2019

Robust Learning from Untrusted Sources

Modern machine learning methods often require more data for training tha...

Please sign up or login with your details

Forgot password? Click here to reset