Sketch-Flip-Merge: Mergeable Sketches for Private Distinct Counting

02/04/2023
by   Jonathan Hehir, et al.
0

Data sketching is a critical tool for distinct counting, enabling multisets to be represented by compact summaries that admit fast cardinality estimates. Because sketches may be merged to summarize multiset unions, they are a basic building block in data warehouses. Although many practical sketches for cardinality estimation exist, none provide privacy when merging. We propose the first practical cardinality sketches that are simultaneously mergeable, differentially private (DP), and have low empirical errors. These introduce a novel randomized algorithm for performing logical operations on noisy bits, a tight privacy analysis, and provably optimal estimation. Our sketches dramatically outperform existing theoretical solutions in simulations and on real-world data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2023

An Effective and Differentially Private Protocol for Secure Distributed Cardinality Estimation

Counting the number of distinct elements distributed over multiple data ...
research
03/29/2022

(Nearly) All Cardinality Estimators Are Differentially Private

We consider privacy in the context of streaming algorithms for cardinali...
research
05/26/2021

Differentially Private Frequency Moments Estimation with Polylogarithmic Space

We prove that 𝔽_p sketch, a well-celebrated streaming algorithm for freq...
research
08/18/2020

Differentially Private Sketches for Jaccard Similarity Estimation

This paper describes two locally-differential private algorithms for rel...
research
01/09/2023

Privacy-Preserving Record Linkage for Cardinality Counting

Several applications require counting the number of distinct items in th...
research
05/03/2022

Differentially Private Triangle and 4-Cycle Counting in the Shuffle Model

Subgraph counting is fundamental for analyzing connection patterns or cl...
research
08/17/2018

Cardinality Estimators do not Preserve Privacy

Cardinality estimators like HyperLogLog are sketching algorithms that es...

Please sign up or login with your details

Forgot password? Click here to reset