Secure and Utility-Aware Data Collection with Condensed Local Differential Privacy

05/15/2019
by   Mehmet Emre Gursoy, et al.
0

Local Differential Privacy (LDP) is popularly used in practice for privacy-preserving data collection. Although existing LDP protocols offer high data utility for large user populations (100,000 or more users), they perform poorly in scenarios with small user populations (such as those in the cybersecurity domain) and lack perturbation mechanisms that are effective for both ordinal and non-ordinal item sequences while protecting sequence length and content simultaneously. In this paper, we address the small user population problem by introducing the concept of Condensed Local Differential Privacy (CLDP) as a specialization of LDP, and develop a suite of CLDP protocols that offer desirable statistical utility while preserving privacy. Our protocols support different types of client data, ranging from ordinal data types in finite metric spaces (numeric malware infection statistics), to non-ordinal items (OS versions, transaction categories), and to sequences of ordinal and non-ordinal items. Extensive experiments are conducted on multiple datasets, including datasets that are an order of magnitude smaller than those used in existing approaches, which show that proposed CLDP protocols yield higher utility compared to existing LDP protocols. Furthermore, case studies with Symantec datasets demonstrate that our protocols outperform existing protocols in key cybersecurity-focused tasks of detecting ransomware outbreaks, identifying targeted and vulnerable OSs, and inspecting suspicious activities on infected machines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2019

Conditional Analysis for Key-Value Data with Local Differential Privacy

Local differential privacy (LDP) has been deemed as the de facto measure...
research
10/01/2022

Frequency Estimation of Evolving Data Under Local Differential Privacy

Collecting and analyzing evolving longitudinal data has become a common ...
research
09/29/2020

DUMP: A Dummy-Point-Based Framework for Histogram Estimation in Shuffle Model

In Central Differential Privacy (CDP), there is a trusted analyst who co...
research
11/05/2019

Data Poisoning Attacks to Local Differential Privacy Protocols

Local Differential Privacy (LDP) protocols enable an untrusted data coll...
research
06/21/2023

PrivSketch: A Private Sketch-based Frequency Estimation Protocol for Data Streams

Local differential privacy (LDP) has recently become a popular privacy-p...
research
11/22/2021

Poisoning Attacks to Local Differential Privacy Protocols for Key-Value Data

Local Differential Privacy (LDP) protocols enable an untrusted server to...
research
09/04/2022

On the Risks of Collecting Multidimensional Data Under Local Differential Privacy

The private collection of multiple statistics from a population is a fun...

Please sign up or login with your details

Forgot password? Click here to reset