Privacy Amplification via Compression: Achieving the Optimal Privacy-Accuracy-Communication Trade-off in Distributed Mean Estimation

04/04/2023
by   Wei-Ning Chen, et al.
0

Privacy and communication constraints are two major bottlenecks in federated learning (FL) and analytics (FA). We study the optimal accuracy of mean and frequency estimation (canonical models for FL and FA respectively) under joint communication and (ε, δ)-differential privacy (DP) constraints. We show that in order to achieve the optimal error under (ε, δ)-DP, it is sufficient for each client to send Θ( n min(ε, ε^2)) bits for FL and Θ(log( nmin(ε, ε^2) )) bits for FA to the server, where n is the number of participating clients. Without compression, each client needs O(d) bits and log d bits for the mean and frequency estimation problems respectively (where d corresponds to the number of trainable parameters in FL or the domain size in FA), which means that we can get significant savings in the regime n min(ε, ε^2) = o(d), which is often the relevant regime in practice. Our algorithms leverage compression for privacy amplification: when each client communicates only partial information about its sample, we show that privacy can be amplified by randomly selecting the part contributed by each client.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2023

Communication-Efficient Federated Learning through Importance Sampling

The high communication cost of sending model updates from the clients to...
research
07/05/2021

Optimizing the Numbers of Queries and Replies in Federated Learning with Differential Privacy

Federated learning (FL) empowers distributed clients to collaboratively ...
research
08/17/2020

Shuffled Model of Federated Learning: Privacy, Communication and Accuracy Trade-offs

We consider a distributed empirical risk minimization (ERM) optimization...
research
03/07/2022

The Fundamental Price of Secure Aggregation in Differentially Private Federated Learning

We consider the problem of training a d dimensional model with distribut...
research
07/22/2020

Breaking the Communication-Privacy-Accuracy Trilemma

Two major challenges in distributed learning and estimation are 1) prese...
research
06/08/2023

Exact Optimality of Communication-Privacy-Utility Tradeoffs in Distributed Mean Estimation

We study the mean estimation problem under communication and local diffe...
research
06/15/2023

Private Federated Frequency Estimation: Adapting to the Hardness of the Instance

In federated frequency estimation (FFE), multiple clients work together ...

Please sign up or login with your details

Forgot password? Click here to reset