Local Differentially Private Frequency Estimation based on Learned Sketches

10/31/2022
by   Meifan Zhang, et al.
0

Sketches are widely used for frequency estimation of data with a large domain. However, sketches-based frequency estimation faces more challenges when considering privacy. Local differential privacy (LDP) is a solution to frequency estimation on sensitive data while preserving the privacy. LDP enables each user to perturb its data on the client-side to protect the privacy, but it also introduces errors to the frequency estimations. The hash collisions in the sketches make the estimations for low-frequent items even worse. In this paper, we propose a two-phase frequency estimation framework for data with a large domain based on an LDP learned sketch, which separates the high-frequent and low-frequent items to avoid the errors caused by hash collisions. We theoretically proved that the proposed method satisfies LDP and it is more accurate than the state-of-the-art frequency estimation methods including Apple-CMS, Apple-HCMS and FLH. The experimental results verify the performance of our method.

READ FULL TEXT

page 16

page 17

research
09/03/2022

LDP-FPMiner: FP-Tree Based Frequent Itemset Mining with Local Differential Privacy

Data aggregation in the setting of local differential privacy (LDP) guar...
research
06/21/2023

PrivSketch: A Private Sketch-based Frequency Estimation Protocol for Data Streams

Local differential privacy (LDP) has recently become a popular privacy-p...
research
12/07/2021

SpaceSaving^±: An Optimal Algorithm for Frequency Estimation and Frequent items in the Bounded Deletion Model

In this paper, we propose the first deterministic algorithms to solve th...
research
12/05/2018

Calibrate: Frequency Estimation and Heavy Hitter Identification with Local Differential Privacy via Incorporating Prior Knowledge

Estimating frequencies of certain items among a population is a basic st...
research
04/13/2021

Fair and Differentially Private Distributed Frequency Estimation

In order to remain competitive, Internet companies collect and analyse u...
research
09/12/2017

Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation

We introduce and study a new data sketch for processing massive datasets...
research
04/10/2023

Differentially Private Numerical Vector Analyses in the Local and Shuffle Model

Numerical vector aggregation plays a crucial role in privacy-sensitive a...

Please sign up or login with your details

Forgot password? Click here to reset