The communication cost of security and privacy in federated frequency estimation

11/18/2022
by   Wei-Ning Chen, et al.
0

We consider the federated frequency estimation problem, where each user holds a private item X_i from a size-d domain and a server aims to estimate the empirical frequency (i.e., histogram) of n items with n ≪ d. Without any security and privacy considerations, each user can communicate its item to the server by using log d bits. A naive application of secure aggregation protocols would, however, require dlog n bits per user. Can we reduce the communication needed for secure aggregation, and does security come with a fundamental cost in communication? In this paper, we develop an information-theoretic model for secure aggregation that allows us to characterize the fundamental cost of security and privacy in terms of communication. We show that with security (and without privacy) Ω( n log d ) bits per user are necessary and sufficient to allow the server to compute the frequency distribution. This is significantly smaller than the dlog n bits per user needed by the naive scheme, but significantly higher than the log d bits per user needed without security. To achieve differential privacy, we construct a linear scheme based on a noisy sketch which locally perturbs the data and does not require a trusted server (a.k.a. distributed differential privacy). We analyze this scheme under ℓ_2 and ℓ_∞ loss. By using our information-theoretic framework, we show that the scheme achieves the optimal accuracy-privacy trade-off with optimal communication cost, while matching the performance in the centralized case where data is stored in the central server.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2022

SwiftAgg+: Achieving Asymptotically Optimal Communication Loads in Secure Aggregation for Federated Learning

We propose SwiftAgg+, a novel secure aggregation protocol for federated ...
research
03/07/2022

The Fundamental Price of Secure Aggregation in Differentially Private Federated Learning

We consider the problem of training a d dimensional model with distribut...
research
08/17/2020

Information-Theoretic Privacy in Federated Submodel learning

We consider information-theoretic privacy in federated submodel learning...
research
06/15/2023

Private Federated Frequency Estimation: Adapting to the Hardness of the Instance

In federated frequency estimation (FFE), multiple clients work together ...
research
02/12/2021

The Distributed Discrete Gaussian Mechanism for Federated Learning with Secure Aggregation

We consider training models on private data that is distributed across u...
research
06/08/2021

Private Multi-Group Aggregation

We study the differentially private multi group aggregation (PMGA) probl...
research
08/11/2020

Security Versus Privacy

Linear queries can be submitted to a server containing private data. The...

Please sign up or login with your details

Forgot password? Click here to reset