Private Protocols for U-Statistics in the Local Model and Beyond

10/09/2019
by   James Bell, et al.
0

In this paper, we study the problem of computing U-statistics of degree 2, i.e., quantities that come in the form of averages over pairs of data points, in the local model of differential privacy (LDP). The class of U-statistics covers many statistical estimates of interest, including Gini mean difference, Kendall's tau coefficient and Area under the ROC Curve (AUC), as well as empirical risk measures for machine learning problems such as ranking, clustering and metric learning. We first introduce an LDP protocol based on quantizing the data into bins and applying randomized response, which guarantees an ϵ-LDP estimate with a Mean Squared Error (MSE) of O(1/√(n)ϵ) under regularity assumptions on the U-statistic or the data distribution. We then propose a specialized protocol for AUC based on a novel use of hierarchical histograms that achieves MSE of O(α^3/nϵ^2) for arbitrary data distribution. We also show that 2-party secure computation allows to design a protocol with MSE of O(1/nϵ^2), without any assumption on the kernel function or data distribution and with total communication linear in the number of users n. Finally, we evaluate the performance of our protocols through experiments on synthetic and real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2022

One-Shot Federated Learning for Model Clustering and Learning in Heterogeneous Environments

We propose a communication efficient approach for federated learning in ...
research
02/03/2020

Private Summation in the Multi-Message Shuffle Model

The shuffle model of differential privacy (Erlingsson et al. SODA 2019; ...
research
04/05/2021

Frequency Estimation Under Multiparty Differential Privacy: One-shot and Streaming

We study the fundamental problem of frequency estimation under both priv...
research
08/30/2020

Data Sanitisation Protocols for the Privacy Funnel with Differential Privacy Guarantees

In the Open Data approach, governments and other public organisations wa...
research
02/17/2021

ppAUC: Privacy Preserving Area Under the Curve with Secure 3-Party Computation

Computing an AUC as a performance measure to compare the quality of diff...
research
06/07/2023

A note on the optimum allocation of resources to follow up unit nonrespondents in probability

Common practice to address nonresponse in probability surveys in Nationa...
research
05/13/2023

A note on bounded distance-based information loss metrics for statistical disclosure control of numeric microdata

In the field of statistical disclosure control, the tradeoff between dat...

Please sign up or login with your details

Forgot password? Click here to reset