Differentially private low-dimensional representation of high-dimensional data

05/26/2023
by   Yiyun He, et al.
0

Differentially private synthetic data provide a powerful mechanism to enable data analysis while protecting sensitive information about individuals. However, when the data lie in a high-dimensional space, the accuracy of the synthetic data suffers from the curse of dimensionality. In this paper, we propose a differentially private algorithm to generate low-dimensional synthetic data efficiently from a high-dimensional dataset with a utility guarantee with respect to the Wasserstein distance. A key step of our algorithm is a private principal component analysis (PCA) procedure with a near-optimal accuracy bound that circumvents the curse of dimensionality. Different from the standard perturbation analysis using the Davis-Kahan theorem, our analysis of private PCA works without assuming the spectral gap for the sample covariance matrix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2023

Algorithmically Effective Differentially Private Synthetic Data

We present a highly effective algorithmic approach for generating ε-diff...
research
11/18/2015

Wishart Mechanism for Differentially Private Principal Components Analysis

We propose a new input perturbation mechanism for publishing a covarianc...
research
07/12/2012

Near-Optimal Algorithms for Differentially-Private Principal Components

Principal components analysis (PCA) is a standard tool for identifying g...
research
05/03/2022

Optimal minimization of the covariance loss

Let X be a random vector valued in ℝ^m such that X_2≤ 1 almost surely. F...
research
03/13/2020

A Wide Dataset of Ear Shapes and Pinna-Related Transfer Functions Generated by Random Ear Drawings

Head-related transfer functions (HRTFs) individualization is a key matte...
research
04/20/2023

DPAF: Image Synthesis via Differentially Private Aggregation in Forward Phase

Differentially private synthetic data is a promising alternative for sen...
research
01/21/2023

Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms

Marginal-based methods achieve promising performance in the synthetic da...

Please sign up or login with your details

Forgot password? Click here to reset