Collecting Telemetry Data Privately

12/05/2017
by   Bolin Ding, et al.
0

The collection and analysis of telemetry data from users' devices is routinely performed by many software companies. Telemetry collection leads to improved user experience but poses significant risks to users' privacy. Locally differentially private (LDP) algorithms have recently emerged as the main tool that allows data collectors to estimate various population statistics, while preserving privacy. The guarantees provided by such algorithms are typically very strong for a single round of telemetry collection, but degrade rapidly when telemetry is collected regularly. In particular, existing LDP algorithms are not suitable for repeated collection of counter data such as daily app usage statistics. In this paper, we develop new LDP mechanisms geared towards repeated collection of counter data, with formal privacy guarantees even after being executed for an arbitrarily long period of time. For two basic analytical tasks, mean estimation and histogram estimation, our LDP mechanisms for repeated data collection provide estimates with comparable or even the same accuracy as existing single-round LDP collection mechanisms. We conduct empirical evaluation on real-world counter datasets to verify our theoretical results. Our mechanisms have been deployed by Microsoft to collect telemetry across millions of devices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2019

Locally Differentially Private Data Collection and Analysis

Local differential privacy (LDP) can provide each user with strong priva...
research
02/24/2016

Discrete Distribution Estimation under Local Privacy

The collection and analysis of user data drives improvements in the app ...
research
07/15/2023

On the Utility Gain of Iterative Bayesian Update for Locally Differentially Private Mechanisms

This paper investigates the utility gain of using Iterative Bayesian Upd...
research
11/28/2019

PCKV: Locally Differentially Private Correlated Key-Value Data Collection with Optimized Utility

Data collection under local differential privacy (LDP) has been mostly s...
research
07/02/2021

Subset Privacy: Draw from an Obfuscated Urn

With the rapidly increasing ability to collect and analyze personal data...
research
06/17/2021

Interval Privacy: A Framework for Data Collection

The emerging public awareness and government regulations of data privacy...

Please sign up or login with your details

Forgot password? Click here to reset