On Rényi Differential Privacy in Statistics-Based Synthetic Data Generation

03/31/2023
by   Takayuki Miura, et al.
0

Privacy protection with synthetic data generation often uses differentially private statistics and model parameters to quantitatively express theoretical security. However, these methods do not take into account privacy protection due to the randomness of data generation. In this paper, we theoretically evaluate Rényi differential privacy of the randomness in data generation of a synthetic data generation method that uses the mean vector and the covariance matrix of an original dataset. Specifically, for a fixed α > 1, we show the condition of ε such that the synthetic data generation satisfies (α, ε)-Rényi differential privacy under a bounded neighboring condition and an unbounded neighboring condition, respectively. In particular, under the unbounded condition, when the size of the original dataset and synthetic datase is 10 million, the mechanism satisfies (4, 0.576)-Rényi differential privacy. We also show that when we translate it into the traditional (ε, δ)-differential privacy, the mechanism satisfies (4.00, 10^-10)-differential privacy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2021

Private sampling: a noiseless approach for generating differentially private synthetic data

In a world where artificial intelligence and data science become omnipre...
research
05/23/2018

pMSE Mechanism: Differentially Private Synthetic Data with Maximal Distributional Similarity

We propose a method for the release of differentially private synthetic ...
research
06/09/2021

Prior-Aware Distribution Estimation for Differential Privacy

Joint distribution estimation of a dataset under differential privacy is...
research
10/04/2017

Differentially Private Database Release via Kernel Mean Embeddings

We lay theoretical foundations for new database release mechanisms that ...
research
03/11/2018

A simple algorithm for estimating distribution parameters from n-dimensional randomized binary responses

Randomized response for privacy protection is attractive as provided dis...
research
06/22/2022

Optimal Local Bayesian Differential Privacy over Markov Chains

In the literature of data privacy, differential privacy is the most popu...
research
04/29/2021

On Linear Time Decidability of Differential Privacy for Programs with Unbounded Inputs

We introduce an automata model for describing interesting classes of dif...

Please sign up or login with your details

Forgot password? Click here to reset