Local Differentially Private Fuzzy Counting in Stream Data using Probabilistic Data Structure

08/10/2022
by   Dinusha Vatsalan, et al.
0

Privacy-preserving estimation of counts of items in streaming data finds applications in several real-world scenarios including word auto-correction and traffic management applications. Recent works of RAPPOR and Apple's count-mean sketch (CMS) algorithm propose privacy preserving mechanisms for count estimation in large volumes of data using probabilistic data structures like counting Bloom filter and CMS. However, these existing methods fall short in providing a sound solution for real-time streaming data applications. In this work, we propose a novel (local) Differentially private mechanism that provides high utility for the streaming data count estimation problem with similar or even lower privacy budgets while providing: a) fuzzy counting to report counts of related or similar items (for instance to account for typing errors and data variations), and b) improved querying efficiency to reduce the response time for real-time querying of counts. We provide formal proofs for privacy and utility guarantees and present extensive experimental evaluation of our algorithm using real and synthetic English words datasets for both the exact and fuzzy counting scenarios. Our privacy preserving mechanism substantially outperforms the prior work in terms of lower querying time, significantly higher utility (accuracy of count estimation) under similar or lower privacy guarantees, at the cost of communication overhead.

READ FULL TEXT
research
01/09/2023

Privacy-Preserving Record Linkage for Cardinality Counting

Several applications require counting the number of distinct items in th...
research
03/25/2021

Differentially Private Normalizing Flows for Privacy-Preserving Density Estimation

Normalizing flow models have risen as a popular solution to the problem ...
research
10/30/2019

Efficient Privacy-Preserving Nonconvex Optimization

While many solutions for privacy-preserving convex empirical risk minimi...
research
10/15/2019

Privacy Preserving Count Statistics

The ability to preserve user privacy and anonymity is important. One of ...
research
09/09/2021

Fighting Fake News in Encrypted Messaging with the Fuzzy Anonymous Complaint Tally System (FACTS)

Recent years have seen a strong uptick in both the prevalence and real-w...
research
10/20/2020

Monitoring Large Crowds With WiFi: A Privacy-Preserving Approach

This paper presents a crowd monitoring system based on the passive detec...
research
07/06/2023

DPM: Clustering Sensitive Data through Separation

Privacy-preserving clustering groups data points in an unsupervised mann...

Please sign up or login with your details

Forgot password? Click here to reset