Federated Heavy Hitters Discovery with Differential Privacy

02/22/2019
by   Wennan Zhu, et al.
0

The discovery of heavy hitters (most frequent items) in user-generated data streams drives improvements in the app and web ecosystems, but often comes with substantial privacy risks. To address these risks, we propose a distributed and privacy-preserving algorithm for discovering the heavy hitters in a population of user-generated data streams. We critically leverage the sampling property of our distributed algorithm to prove that it is inherently differentially private, without requiring any addition of noise. We also examine the trade-off between privacy and utility, and show that our algorithm provides excellent utility while achieving strong privacy guarantees. We validate our findings both theoretically, using worst-case analyses, and practically, using a Twitter dataset with 1.6M tweets and over 650k users.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2019

Automatic Discovery of Privacy-Utility Pareto Fronts

Differential privacy is a mathematical framework for privacy-preserving ...
research
07/05/2023

Privacy-Preserving Federated Heavy Hitter Analytics for Non-IID Data

Federated heavy-hitter analytics involves the identification of the most...
research
04/18/2022

PrivateRec: Differentially Private Training and Serving for Federated News Recommendation

Privacy protection is an essential issue in personalized news recommenda...
research
07/21/2023

Differentially Private Heavy Hitter Detection using Federated Analytics

In this work, we study practical heuristics to improve the performance o...
research
09/21/2023

Privacy-Preserving In-Context Learning with Differentially Private Few-Shot Generation

We study the problem of in-context learning (ICL) with large language mo...
research
12/10/2021

Sample and Threshold Differential Privacy: Histograms and applications

Federated analytics relies on the collection of accurate statistics abou...
research
10/20/2019

Leveraging Hierarchical Representations for Preserving Privacy and Utility in Text

Guaranteeing a certain level of user privacy in an arbitrary piece of te...

Please sign up or login with your details

Forgot password? Click here to reset