Simulation-based, Finite-sample Inference for Privatized Data

03/09/2023
by   Jordan Awan, et al.
0

Privacy protection methods, such as differentially private mechanisms, introduce noise into resulting statistics which often results in complex and intractable sampling distributions. In this paper, we propose to use the simulation-based "repro sample" approach to produce statistically valid confidence intervals and hypothesis tests based on privatized statistics. We show that this methodology is applicable to a wide variety of private inference problems, appropriately accounts for biases introduced by privacy mechanisms (such as by clamping), and improves over other state-of-the-art inference methods such as the parametric bootstrap in terms of the coverage and type I error of the private inference. We also develop significant improvements and extensions for the repro sample methodology for general models (not necessarily related to privacy), including 1) modifying the procedure to ensure guaranteed coverage and type I errors, even accounting for Monte Carlo error, and 2) proposing efficient numerical algorithms to implement the confidence intervals and p-values.

READ FULL TEXT

page 15

page 16

page 26

research
11/10/2017

Finite Sample Differentially Private Confidence Intervals

We study the problem of estimating finite sample confidence intervals of...
research
04/17/2020

Leveraging the Fisher randomization test using confidence distributions: inference, combination and fusion learning

The flexibility and wide applicability of the Fisher randomization test ...
research
10/12/2022

Differentially Private Bootstrap: New Privacy Analysis and Inference Strategies

Differential private (DP) mechanisms protect individual-level informatio...
research
06/14/2020

General-Purpose Differentially-Private Confidence Intervals

One of the most common statistical goals is to estimate a population par...
research
06/18/2021

Non-parametric Differentially Private Confidence Intervals for the Median

Differential privacy is a restriction on data processing algorithms that...
research
03/31/2019

Differentially Private Inference for Binomial Data

We derive uniformly most powerful (UMP) tests for simple and one-sided h...
research
01/06/2023

Rank-transformed subsampling: inference for multiple data splitting and exchangeable p-values

Many testing problems are readily amenable to randomised tests such as t...

Please sign up or login with your details

Forgot password? Click here to reset