Sample-efficient private data release for Lipschitz functions under sparsity assumptions

02/19/2023
by   Konstantin Donhauser, et al.
0

Differential privacy is the de facto standard for protecting privacy in a variety of applications. One of the key challenges is private data release, which is particularly relevant in scenarios where limited information about the desired statistics is available beforehand. Recent work has presented a differentially private data release algorithm that achieves optimal rates of order n^-1/d, with n being the size of the dataset and d being the dimension, for the worst-case error over all Lipschitz continuous statistics. This type of guarantee is desirable in many practical applications, as for instance it ensures that clusters present in the data are preserved. However, due to the "slow" rates, it is often infeasible in practice unless the dimension of the data is small. We demonstrate that these rates can be significantly improved to n^-1/s when only guarantees over s-sparse Lipschitz continuous functions are required, or to n^-1/(s+1) when the data lies on an unknown s-dimensional subspace, disregarding logarithmic factors. We therefore obtain practically meaningful rates for moderate constants s which motivates future work on computationally efficient approximate algorithms for this problem.

READ FULL TEXT

page 14

page 15

research
06/02/2020

Designing Differentially Private Estimators in High Dimensions

We study differentially private mean estimation in a high-dimensional se...
research
11/22/2019

Privacy-preserving parametric inference: a case for robust statistics

Differential privacy is a cryptographically-motivated approach to privac...
research
12/10/2019

Classification under local differential privacy

We consider the binary classification problem in a setup that preserves ...
research
09/03/2022

Differential Privacy on Dynamic Data

A fundamental problem in differential privacy is to release a privatized...
research
02/24/2021

No-Regret Algorithms for Private Gaussian Process Bandit Optimization

The widespread proliferation of data-driven decision-making has ushered ...
research
03/24/2021

U.S. Broadband Coverage Data Set: A Differentially Private Data Release

Broadband connectivity is a key metric in today's economy. In an era of ...

Please sign up or login with your details

Forgot password? Click here to reset