Differentially Private Numerical Vector Analyses in the Local and Shuffle Model

04/10/2023
by   Shaowei Wang, et al.
0

Numerical vector aggregation plays a crucial role in privacy-sensitive applications, such as distributed gradient estimation in federated learning and statistical analysis of key-value data. In the context of local differential privacy, this study provides a tight minimax error bound of O(ds/nϵ^2), where d represents the dimension of the numerical vector and s denotes the number of non-zero entries. By converting the conditional/unconditional numerical mean estimation problem into a frequency estimation problem, we develop an optimal and efficient mechanism called Collision. In contrast, existing methods exhibit sub-optimal error rates of O(d^2/nϵ^2) or O(ds^2/nϵ^2). Specifically, for unconditional mean estimation, we leverage the negative correlation between two frequencies in each dimension and propose the CoCo mechanism, which further reduces estimation errors for mean values compared to Collision. Moreover, to surpass the error barrier in local privacy, we examine privacy amplification in the shuffle model for the proposed mechanisms and derive precisely tight amplification bounds. Our experiments validate and compare our mechanisms with existing approaches, demonstrating significant error reductions for frequency estimation and mean estimation on numerical vectors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2021

Locally Differentially Private Sparse Vector Aggregation

Vector mean estimation is a central primitive in federated analytics. In...
research
02/22/2023

Multi-Message Shuffled Privacy in Federated Learning

We study differentially private distributed optimization under communica...
research
01/10/2022

Optimal and Differentially Private Data Acquisition: Central and Local Mechanisms

We consider a platform's problem of collecting data from privacy sensiti...
research
10/18/2018

Locally Private Mean Estimation: Z-test and Tight Confidence Intervals

This work provides tight upper- and lower-bounds for the problem of mean...
research
10/22/2021

Tight and Robust Private Mean Estimation with Few Users

In this work, we study high-dimensional mean estimation under user-level...
research
10/31/2022

Local Differentially Private Frequency Estimation based on Learned Sketches

Sketches are widely used for frequency estimation of data with a large d...
research
08/14/2019

Aggregating Votes with Local Differential Privacy: Usefulness, Soundness vs. Indistinguishability

Voting plays a central role in bringing crowd wisdom to collective decis...

Please sign up or login with your details

Forgot password? Click here to reset