Differentially Private Data Generation Needs Better Features

05/25/2022
by   Fredrik Harder, et al.
0

Training even moderately-sized generative models with differentially-private stochastic gradient descent (DP-SGD) is difficult: the required level of noise for reasonable levels of privacy is simply too large. We advocate instead building off a good, relevant representation on public data, then using private data only for "transfer learning." In particular, we minimize the maximum mean discrepancy (MMD) between private target data and the generated distribution, using a kernel based on perceptual features from a public dataset. With the MMD, we can simply privatize the data-dependent term once and for all, rather than introducing noise at each step of optimization as in DP-SGD. Our algorithm allows us to generate CIFAR10-level images faithfully with ε≈ 2, far surpassing the current state of the art, which only models MNIST and FashionMNIST at ε≈ 10. Our work introduces simple yet powerful foundations for reducing the gap between private and non-private deep generative models.

READ FULL TEXT
research
10/18/2022

Differentially Private Diffusion Models

While modern machine learning models rely on increasingly large training...
research
06/08/2021

PEARL: Data Synthesis via Private Embeddings and Adversarial Reconstruction Learning

We propose a new framework of synthesizing data using deep generative mo...
research
07/09/2021

Differentially private training of neural networks with Langevin dynamics for calibrated predictive uncertainty

We show that differentially private stochastic gradient descent (DP-SGD)...
research
02/26/2020

Differentially Private Mean Embeddings with Random Features (DP-MERF) for Simple Practical Synthetic Data Generation

We present a differentially private data generation paradigm using rando...
research
06/09/2021

Polynomial magic! Hermite polynomials for private data generation

Kernel mean embedding is a useful tool to compare probability measures. ...
research
11/17/2021

Network Generation with Differential Privacy

We consider the problem of generating private synthetic versions of real...
research
05/10/2021

Differentially Private Transfer Learning with Conditionally Deep Autoencoders

This paper considers the problem of differentially private semi-supervis...

Please sign up or login with your details

Forgot password? Click here to reset