PCKV: Locally Differentially Private Correlated Key-Value Data Collection with Optimized Utility

11/28/2019
by   Xiaolan Gu, et al.
0

Data collection under local differential privacy (LDP) has been mostly studied for homogeneous data. Real-world applications often involve a mixture of different data types such as key-value pairs, where the frequency of keys and mean of values under each key must be estimated simultaneously. For key-value data collection with LDP, it is challenging to achieve a good utility-privacy tradeoff since the data contains two dimensions and a user may possess multiple key-value pairs. There is also an inherent correlation between key and values which if not harnessed, will lead to poor utility. In this paper, we propose a locally differentially private key-value data collection framework that utilizes correlated perturbations to enhance utility. We instantiate our framework by two protocols PCKV-UE (based on Unary Encoding) and PCKV-GRR (based on Generalized Randomized Response), where we design an advanced Padding-and-Sampling mechanism and an improved mean estimator which is non-interactive. Due to our correlated key and value perturbation mechanisms, the composed privacy budget is shown to be less than that of independent perturbation of key and value, which enables us to further optimize the perturbation parameters via budget allocation. Experimental results on both synthetic and real-world datasets show that our proposed protocols achieve better utility for both frequency and mean estimations under the same LDP guarantees than state-of-the-art mechanisms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2019

Conditional Analysis for Key-Value Data with Local Differential Privacy

Local differential privacy (LDP) has been deemed as the de facto measure...
research
06/05/2019

Locally Differentially Private Data Collection and Analysis

Local differential privacy (LDP) can provide each user with strong priva...
research
07/15/2023

On the Utility Gain of Iterative Bayesian Update for Locally Differentially Private Mechanisms

This paper investigates the utility gain of using Iterative Bayesian Upd...
research
10/25/2020

Differentially Private Weighted Sampling

Common datasets have the form of elements with keys (e.g., transactions ...
research
08/28/2023

Zip to Zip-it: Compression to Achieve Local Differential Privacy

Local differential privacy techniques for numerical data typically trans...
research
12/17/2022

Stateful Switch: Optimized Time Series Release with Local Differential Privacy

Time series data have numerous applications in big data analytics. Howev...
research
12/05/2017

Collecting Telemetry Data Privately

The collection and analysis of telemetry data from users' devices is rou...

Please sign up or login with your details

Forgot password? Click here to reset