Better Differentially Private Approximate Histograms and Heavy Hitters using the Misra-Gries Sketch

01/06/2023
by   Christian Janos Lebeda, et al.
0

We consider the problem of computing differentially private approximate histograms and heavy hitters in a stream of elements. In the non-private setting, this is often done using the sketch of Misra and Gries [Science of Computer Programming, 1982]. Chan, Li, Shi, and Xu [PETS 2012] describe a differentially private version of the Misra-Gries sketch, but the amount of noise it adds can be large and scales linearly with the size of the sketch: the more accurate the sketch is, the more noise this approach has to add. We present a better mechanism for releasing Misra-Gries sketch under (ε,δ)-differential privacy. It adds noise with magnitude independent of the size of the sketch size, in fact, the maximum error coming from the noise is the same as the best known in the private non-streaming setting, up to a constant factor. Our mechanism is simple and likely to be practical. We also give a simple post-processing step of the Misra-Gries sketch that does not increase the worst-case error guarantee. It is sufficient to add noise to this new sketch with less than twice the magnitude of the non-streaming setting. This improves on the previous result for ε-differential privacy where the noise scales linearly to the size of the sketch.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2021

DPGen: Automated Program Synthesis for Differential Privacy

Differential privacy has become a de facto standard for releasing data i...
research
05/17/2022

Improved Utility Analysis of Private CountSketch

Sketching is an important tool for dealing with high-dimensional vectors...
research
01/31/2020

Efficient Differentially Private F_0 Linear Sketching

A powerful feature of linear sketches is that from sketches of two data ...
research
05/26/2021

Differentially Private Frequency Moments Estimation with Polylogarithmic Space

We prove that 𝔽_p sketch, a well-celebrated streaming algorithm for freq...
research
01/10/2022

Bounded Space Differentially Private Quantiles

Estimating the quantiles of a large dataset is a fundamental problem in ...
research
03/29/2022

(Nearly) All Cardinality Estimators Are Differentially Private

We consider privacy in the context of streaming algorithms for cardinali...
research
11/07/2022

Additive Noise Mechanisms for Making Randomized Approximation Algorithms Differentially Private

The exponential increase in the amount of available data makes taking ad...

Please sign up or login with your details

Forgot password? Click here to reset