Re-weighting of Vector-weighted Mechanisms for Utility Maximization under Differential Privacy

06/01/2020
by   Jingchen Hu, et al.
0

We implement a pseudo posterior synthesizer for microdata dissemination under two different vector-weighted schemes. Both schemes target high-risk records by exponentiating each of their likelihood contributions with a record-level weight, α_i ∈ [0,1] for record i ∈ (1,...,n). The first vector-weighted synthesizing mechanism computes the maximum (Lipschitz) bound, Δ_x_i, of each log-likelihood contribution over the space of parameter values, and sets the by-record weight α_i∝ 1 / Δ_x_i. The second vector-weighted synthesizer is based on constructing an identification disclosure risk probability, IR_i of record i, and setting the by-record weight α_i ∝ 1 / IR_i. We compute the overall Lipschitz bound, Δ_α,𝐱, for the database 𝐱, under each vector-weighted synthesizer such that both provide an (ϵ = 2 Δ_α,𝐱)-differential privacy (DP) formal guarantee. We propose a new vector re-weighting strategy that maximizes the data utility given any privacy budget for the vector-weighted synthesizers by adjusting the by-record weights, (α_i)_i = 1^n, such that their individual Lipschitz bounds, Δ_α,x_i, approach the bound for the entire database, Δ_α,𝐱. We illustrate our methods using simulated count data with and without over-dispersion-induced skewness and compare the results to a scalar-weighted synthesizer under the Exponential Mechanism (EM).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2019

Bayesian Pseudo Posterior Mechanism under Differential Privacy

We propose a Bayesian pseudo posterior mechanism to generate record-leve...
research
05/10/2022

Mechanisms for Global Differential Privacy under Bayesian Data Synthesis

This paper introduces a new method that embeds any Bayesian model used t...
research
08/20/2019

Risk-Efficient Bayesian Data Synthesis for Privacy Protection

High-utility and low-risks synthetic data facilitates microdata dissemin...
research
01/19/2019

Bayesian Pseudo Posterior Synthesis for Data Privacy Protection

Statistical agencies utilize models to synthesize respondent-level data ...
research
01/15/2021

Private Tabular Survey Data Products through Synthetic Microdata Generation

We propose three synthetic microdata approaches to generate private tabu...
research
06/19/2023

Prior-itizing Privacy: A Bayesian Approach to Setting the Privacy Budget in Differential Privacy

When releasing outputs from confidential data, agencies need to balance ...

Please sign up or login with your details

Forgot password? Click here to reset