Optimizing Random Mixup with Gaussian Differential Privacy

02/14/2022
by   Donghao Li, et al.
0

Differentially private data release receives rising attention in machine learning community. Recently, an algorithm called DPMix is proposed to release high-dimensional data after a random mixup of degree m with differential privacy. However, limited theoretical justifications are given about the "sweet spot m" phenomenon, and directly applying DPMix to image data suffers from severe loss of utility. In this paper, we revisit random mixup with recent progress on differential privacy. In theory, equipped with Gaussian Differential Privacy with Poisson subsampling, a tight closed form analysis is presented that enables a quantitative characterization of optimal mixup m^* based on linear regression models. In practice, mixup of features, extracted by handcraft or pre-trained neural networks such as self-supervised learning without labels, is adopted to significantly boost the performance with privacy protection. We name it as Differentially Private Feature Mixup (DPFMix). Experiments on MNIST, CIFAR10/100 are conducted to demonstrate its remarkable utility improvement and protection against attacks.

READ FULL TEXT
research
10/11/2021

Generalization Techniques Empirically Outperform Differential Privacy against Membership Inference

Differentially private training algorithms provide protection against on...
research
05/02/2020

Differentially Private Generation of Small Images

We explore the training of generative adversarial networks with differen...
research
06/22/2020

Private Distributed Mean Estimation

Ever since its proposal, differential privacy has become the golden stan...
research
08/26/2022

Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy

Differential privacy mechanisms are increasingly used to enable public r...
research
05/12/2023

Impacts of Differential Privacy on Fostering more Racially and Ethnically Diverse Elementary Schools

In the face of increasingly severe privacy threats in the era of data an...
research
07/04/2018

Privacy Amplification by Subsampling: Tight Analyses via Couplings and Divergences

Differential privacy comes equipped with multiple analytical tools for t...
research
08/24/2017

Differentially Private Regression for Discrete-Time Survival Analysis

In survival analysis, regression models are used to understand the effec...

Please sign up or login with your details

Forgot password? Click here to reset