Private Post-GAN Boosting

07/23/2020
by   Marcel Neunhoeffer, et al.
16

Differentially private GANs have proven to be a promising approach for generating realistic synthetic data without compromising the privacy of individuals. However, due to the privacy-protective noise introduced in the training, the convergence of GANs becomes even more elusive, which often leads to poor utility in the output generator at the end of training. We propose Private post-GAN boosting (Private PGB), a differentially private method that combines samples produced by the sequence of generators obtained during GAN training to create a high-quality synthetic dataset. Our method leverages the Private Multiplicative Weights method (Hardt and Rothblum, 2010) and the discriminator rejection sampling technique (Azadi et al., 2019) for reweighting generated samples, to obtain high quality synthetic data even in cases where GAN training does not converge. We evaluate Private PGB on a Gaussian mixture dataset and two US Census datasets, and demonstrate that Private PGB improves upon the standard private GAN approach across a collection of quality measures. Finally, we provide a non-private variant of PGB that improves the data quality of standard GAN training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2023

Private GANs, Revisited

We show that the canonical approach for training differentially private ...
research
03/08/2018

Generating Differentially Private Datasets Using GANs

In this paper, we present a technique for generating artificial datasets...
research
01/10/2022

Differentially Private Generative Adversarial Networks with Model Inversion

To protect sensitive data in training a Generative Adversarial Network (...
research
09/29/2020

imdpGAN: Generating Private and Specific Data with Generative Adversarial Networks

Generative Adversarial Network (GAN) and its variants have shown promisi...
research
04/06/2020

Leveraging GANs to Improve Continuous Path Keyboard Input Models

Continuous path keyboard input has higher inherent ambiguity than standa...
research
08/10/2023

The Fast and the Private: Task-based Dataset Search

Modern dataset search platforms employ ML task-based utility metrics ins...
research
10/22/2020

DPD-InfoGAN: Differentially Private Distributed InfoGAN

Generative Adversarial Networks (GANs) are deep learning architectures c...

Please sign up or login with your details

Forgot password? Click here to reset