Iterative Methods for Private Synthetic Data: Unifying Framework and New Methods

06/14/2021
by   Terrance Liu, et al.
0

We study private synthetic data generation for query release, where the goal is to construct a sanitized version of a sensitive dataset, subject to differential privacy, that approximately preserves the answers to a large collection of statistical queries. We first present an algorithmic framework that unifies a long line of iterative algorithms in the literature. Under this framework, we propose two new methods. The first method, private entropy projection (PEP), can be viewed as an advanced variant of MWEM that adaptively reuses past query measurements to boost accuracy. Our second method, generative networks with the exponential mechanism (GEM), circumvents computational bottlenecks in algorithms such as MWEM and PEP by optimizing over generative models parameterized by neural networks, which capture a rich family of distributions while enabling fast gradient-based optimization. We demonstrate that PEP and GEM empirically outperform existing algorithms. Furthermore, we show that GEM nicely incorporates prior information from public data while overcoming limitations of PMW^Pub, the existing state-of-the-art method that also leverages public data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2020

New Oracle-Efficient Algorithms for Private Synthetic Data Release

We present three new algorithms for constructing differentially private ...
research
03/11/2021

Differentially Private Query Release Through Adaptive Projection

We propose, implement, and evaluate a new algorithm for releasing answer...
research
06/13/2022

Private Synthetic Data with Hierarchical Structure

We study the problem of differentially private synthetic data generation...
research
12/30/2020

PrivSyn: Differentially Private Data Synthesis

In differential privacy (DP), a challenging problem is to generate synth...
research
09/15/2023

DP-PQD: Privately Detecting Per-Query Gaps In Synthetic Data Generated By Black-Box Mechanisms

Synthetic data generation methods, and in particular, private synthetic ...
research
02/17/2021

Leveraging Public Data for Practical Private Query Release

In many statistical problems, incorporating priors can significantly imp...
research
06/05/2023

Generating Private Synthetic Data with Genetic Algorithms

We study the problem of efficiently generating differentially private sy...

Please sign up or login with your details

Forgot password? Click here to reset