DP-PCA: Statistically Optimal and Differentially Private PCA

05/27/2022
by   Xiyang Liu, et al.
4

We study the canonical statistical task of computing the principal component from n i.i.d. data in d dimensions under (ε,δ)-differential privacy. Although extensively studied in literature, existing solutions fall short on two key aspects: (i) even for Gaussian data, existing private algorithms require the number of samples n to scale super-linearly with d, i.e., n=Ω(d^3/2), to obtain non-trivial results while non-private PCA requires only n=O(d), and (ii) existing techniques suffer from a non-vanishing error even when the randomness in each data point is arbitrarily small. We propose DP-PCA, which is a single-pass algorithm that overcomes both limitations. It is based on a private minibatch gradient ascent method that relies on private mean estimation, which adds minimal noise required to ensure privacy by adapting to the variance of a given minibatch of gradients. For sub-Gaussian data, we provide nearly optimal statistical error rates even for n=Õ(d). Furthermore, we provide a lower bound showing that sub-Gaussian style assumption is necessary in obtaining the optimal error rate.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2022

(Nearly) Optimal Private Linear Regression via Adaptive Clipping

We study the problem of differentially private linear regression where e...
research
02/03/2023

From Robustness to Privacy and Back

We study the relationship between two desiderata of algorithms in statis...
research
07/12/2012

Near-Optimal Algorithms for Differentially-Private Principal Components

Principal components analysis (PCA) is a standard tool for identifying g...
research
06/27/2019

Differentially private sub-Gaussian location estimators

We tackle the problem of estimating a location parameter with differenti...
research
02/02/2023

Convergence of Gradient Descent with Linearly Correlated Noise and Applications to Differentially Private Learning

We study stochastic optimization with linearly correlated noise. Our stu...
research
06/26/2023

Optimal Differentially Private Learning with Public Data

Differential Privacy (DP) ensures that training a machine learning model...
research
03/07/2018

Revisiting differentially private linear regression: optimal and adaptive prediction & estimation in unbounded domain

We revisit the problem of linear regression under a differential privacy...

Please sign up or login with your details

Forgot password? Click here to reset