Analyzing Privacy Leakage in Machine Learning via Multiple Hypothesis Testing: A Lesson From Fano

10/24/2022
by   Chuan Guo, et al.
0

Differential privacy (DP) is by far the most widely accepted framework for mitigating privacy risks in machine learning. However, exactly how small the privacy parameter ϵ needs to be to protect against certain privacy risks in practice is still not well-understood. In this work, we study data reconstruction attacks for discrete data and analyze it under the framework of multiple hypothesis testing. We utilize different variants of the celebrated Fano's inequality to derive upper bounds on the inferential power of a data reconstruction adversary when the model is trained differentially privately. Importantly, we show that if the underlying private data takes values from a set of size M, then the target privacy parameter ϵ can be O(log M) before the adversary gains significant inferential power. Our analysis offers theoretical evidence for the empirical effectiveness of DP against data reconstruction attacks even at relatively large values of ϵ.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2023

Bounding data reconstruction attacks with the hypothesis testing interpretation of differential privacy

We explore Reconstruction Robustness (ReRo), which was recently proposed...
research
01/28/2022

Bounding Training Data Reconstruction in Private (Deep) Learning

Differential privacy is widely accepted as the de facto method for preve...
research
10/24/2022

Generalised Likelihood Ratio Testing Adversaries through the Differential Privacy Lens

Differential Privacy (DP) provides tight upper bounds on the capabilitie...
research
11/19/2022

A Survey on Differential Privacy with Machine Learning and Future Outlook

Nowadays, machine learning models and applications have become increasin...
research
03/29/2023

Non-Asymptotic Lower Bounds For Training Data Reconstruction

We investigate semantic guarantees of private learning algorithms for th...
research
06/09/2022

Log-Concave and Multivariate Canonical Noise Distributions for Differential Privacy

A canonical noise distribution (CND) is an additive mechanism designed t...
research
11/06/2022

Confidence-Ranked Reconstruction of Census Microdata from Published Statistics

A reconstruction attack on a private dataset D takes as input some publi...

Please sign up or login with your details

Forgot password? Click here to reset