Private False Discovery Rate Control

11/12/2015
by   Cynthia Dwork, et al.
0

We provide the first differentially private algorithms for controlling the false discovery rate (FDR) in multiple hypothesis testing, with essentially no loss in power under certain conditions. Our general approach is to adapt a well-known variant of the Benjamini-Hochberg procedure (BHq), making each step differentially private. This destroys the classical proof of FDR control. To prove FDR control of our method, (a) we develop a new proof of the original (non-private) BHq algorithm and its robust variants -- a proof requiring only the assumption that the true null test statistics are independent, allowing for arbitrary correlations between the true nulls and false nulls. This assumption is fairly weak compared to those previously shown in the vast literature on this topic, and explains in part the empirical robustness of BHq. Then (b) we relate the FDR control properties of the differentially private version to the control properties of the non-private version. We also present a low-distortion "one-shot" differentially private primitive for "top k" problems, e.g., "Which are the k most popular hobbies?" (which we apply to: "Which hypotheses have the k most significant p-values?"), and use it to get a faster privacy-preserving instantiation of our general approach at little cost in accuracy. The proof of privacy for the one-shot top k algorithm introduces a new technique of independent interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2018

Differentially Private False Discovery Rate Control

Differential privacy provides a rigorous framework for privacy-preservin...
research
05/31/2023

Adaptive False Discovery Rate Control with Privacy Guarantee

Differentially private multiple testing procedures can protect the infor...
research
05/18/2021

Oneshot Differentially Private Top-k Selection

Being able to efficiently and accurately select the top-k elements witho...
research
11/03/2017

Differentially Private ANOVA Testing

Modern society generates an incredible amount of data about individuals,...
research
08/14/2022

Differentially Private Hypothesis Testing with the Subsampled and Aggregated Randomized Response Mechanism

Randomized response is one of the oldest and most well-known methods for...
research
03/24/2023

Differentially Private Synthetic Control

Synthetic control is a causal inference tool used to estimate the treatm...
research
09/05/2018

A Differentially Private Wilcoxon Signed-Rank Test

Hypothesis tests are a crucial statistical tool for data mining and are ...

Please sign up or login with your details

Forgot password? Click here to reset