Compressive Privatization: Sparse Distribution Estimation under Locally Differentially Privacy

12/03/2020
by   Zhongzheng Xiong, et al.
0

We consider the problem of discrete distribution estimation under locally differential privacy. Distribution estimation is one of the most fundamental estimation problems, which is widely studied in both non-private and private settings. In the local model, private mechanisms with provably optimal sample complexity are known. However, they are optimal only in the worst-case sense; their sample complexity is proportional to the size of the entire universe, which could be huge in practice (e.g., all IP addresses). We show that as long as the target distribution is sparse or approximately sparse (e.g., highly skewed), the number of samples needed could be significantly reduced. The sample complexity of our new mechanism is characterized by the sparsity of the target distribution and only weakly depends on the size the universe. Our mechanism does privatization and dimensionality reduction simultaneously, and the sample complexity will only depend on the reduced dimensionality. The original distribution is then recovered using tools from compressive sensing. To complement our theoretical results, we conduct experimental studies, the results of which clearly demonstrate the advantages of our method and confirm our theoretical findings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2018

Communication Efficient, Sample Optimal, Linear Time Locally Private Discrete Distribution Estimation

We consider discrete distribution estimation over k elements under ε-loc...
research
11/22/2019

Privately Learning Thresholds: Closing the Exponential Gap

We study the sample complexity of learning threshold functions under the...
research
07/26/2022

The Sample Complexity of Forecast Aggregation

We consider a Bayesian forecast aggregation model where n experts, after...
research
02/21/2020

Private Mean Estimation of Heavy-Tailed Distributions

We give new upper and lower bounds on the minimax sample complexity of d...
research
10/30/2020

Estimating Sparse Discrete Distributions Under Local Privacy and Communication Constraints

We consider the task of estimating sparse discrete distributions under l...
research
03/24/2023

The Exact Sample Complexity Gain from Invariances for Kernel Regression on Manifolds

In practice, encoding invariances into models helps sample complexity. I...
research
10/19/2020

On the Sample Complexity of Privately Learning Unbounded High-Dimensional Gaussians

We provide sample complexity upper bounds for agnostically learning mult...

Please sign up or login with your details

Forgot password? Click here to reset