Understanding how Differentially Private Generative Models Spend their Privacy Budget

05/18/2023
by   Georgi Ganev, et al.
0

Generative models trained with Differential Privacy (DP) are increasingly used to produce synthetic data while reducing privacy risks. Navigating their specific privacy-utility tradeoffs makes it challenging to determine which models would work best for specific settings/tasks. In this paper, we fill this gap in the context of tabular data by analyzing how DP generative models distribute privacy budgets across rows and columns, arguably the main source of utility degradation. We examine the main factors contributing to how privacy budgets are spent, including underlying modeling techniques, DP mechanisms, and data dimensionality. Our extensive evaluation of both graphical and deep generative models sheds light on the distinctive features that render them suitable for different settings and tasks. We show that graphical models distribute the privacy budget horizontally and thus cannot handle relatively wide datasets while the performance on the task they were optimized for monotonically increases with more data. Deep generative models spend their budget per iteration, so their behavior is less predictable with varying dataset dimensions but could perform better if trained on more features. Also, low levels of privacy (ϵ≥100) could help some models generalize, achieving better results than without applying DP.

READ FULL TEXT
research
10/18/2022

Differentially Private Diffusion Models

While modern machine learning models rely on increasingly large training...
research
01/05/2018

Differentially Private Releasing via Deep Generative Model

Privacy-preserving releasing of complex data (e.g., image, text, audio) ...
research
09/23/2021

Robin Hood and Matthew Effects – Differential Privacy Has Disparate Impact on Synthetic Data

Generative models trained using Differential Privacy (DP) are increasing...
research
04/27/2022

Spending Privacy Budget Fairly and Wisely

Differentially private (DP) synthetic data generation is a practical met...
research
12/20/2022

Local Differential Privacy Image Generation Using Flow-based Deep Generative Models

Diagnostic radiologists need artificial intelligence (AI) for medical im...
research
05/24/2023

Can Copyright be Reduced to Privacy?

There is an increasing concern that generative AI models may produce out...
research
05/30/2023

How Generative Models Improve LOS Estimation in 6G Non-Terrestrial Networks

With the advent of 5G and the anticipated arrival of 6G, there has been ...

Please sign up or login with your details

Forgot password? Click here to reset