Unbiased Statistical Estimation and Valid Confidence Intervals Under Differential Privacy

10/27/2021
by   Christian Covington, et al.
0

We present a method for producing unbiased parameter estimates and valid confidence intervals under the constraints of differential privacy, a formal framework for limiting individual information leakage from sensitive data. Prior work in this area is limited in that it is tailored to calculating confidence intervals for specific statistical procedures, such as mean estimation or simple linear regression. While other recent work can produce confidence intervals for more general sets of procedures, they either yield only approximately unbiased estimates, are designed for one-dimensional outputs, or assume significant user knowledge about the data-generating distribution. Our method induces distributions of mean and covariance estimates via the bag of little bootstraps (BLB) and uses them to privately estimate the parameters' sampling distribution via a generalized version of the CoinPress estimation algorithm. If the user can bound the parameters of the BLB-induced parameters and provide heavier-tailed families, the algorithm produces unbiased parameter estimates and valid confidence intervals which hold with arbitrarily high probability. These results hold in high dimensions and for any estimation procedure which behaves nicely under the bootstrap.

READ FULL TEXT

page 19

page 20

page 22

page 24

research
03/09/2018

Exceedance probability for parameter estimates

Many researchers and statisticians are conflicted over the practice of h...
research
01/10/2018

Generalized Linear Models with Linear Constraints for Microbiome Compositional Data

Motivated by regression analysis for microbiome compositional data, this...
research
08/21/2021

Statistical Quantification of Differential Privacy: A Local Approach

In this work we introduce a new approach for statistical quantification ...
research
06/21/2019

Guaranteed Validity for Empirical Approaches to Adaptive Data Analysis

We design a general framework for answering adaptive statistical queries...
research
11/19/2016

A Bayesian approach to type-specific conic fitting

A perturbative approach is used to quantify the effect of noise in data ...
research
06/27/2022

Network resampling for estimating uncertainty

With network data becoming ubiquitous in many applications, many models ...
research
12/31/2021

Two-Stage and Sequential Unbiased Estimation of N in Binomial Trials, when the Probability of Success p is Unknown

We propose two-stage and sequential procedures to construct prescribed p...

Please sign up or login with your details

Forgot password? Click here to reset