A Differentially Private Bayesian Approach to Replication Analysis

11/26/2021
by   Chengxin Yang, et al.
0

Replication analysis is widely used in many fields of study. Once a research is published, many other researchers will conduct the same or very similar analysis to confirm the reliability of the published research. However, what if the data is confidential? In particular, if the data sets used for the studies are confidential, we cannot release the results of replication analyses to any entity without the permission to access the data sets, otherwise it may result in serious privacy leakage especially when the published study and replication studies are using similar or common data sets. For example, examining the influence of the treatment on outliers can cause serious leakage of the information about outliers. In this paper, we build two frameworks for replication analysis by a differentially private Bayesian approach. We formalize our questions of interest and illustrates the properties of our methods by a combination of theoretical analysis and simulation to show the feasibility of our approach. We also provide some guidance on the choice of parameters and interpretation of the results.

READ FULL TEXT

page 19

page 21

page 24

page 26

research
11/11/2022

Differentially Private Methods for Compositional Data

Protecting individuals' private information while still allowing modeler...
research
10/04/2019

Differentially Private Survival Function Estimation

Survival function estimation is used in many disciplines, but it is most...
research
05/19/2022

Differentially Private Linear Sketches: Efficient Implementations and Applications

Linear sketches have been widely adopted to process fast data streams, a...
research
08/08/2023

Accurate, Explainable, and Private Models: Providing Recourse While Minimizing Training Data Leakage

Machine learning models are increasingly utilized across impactful domai...
research
08/09/2023

Collaborative Learning From Distributed Data With Differentially Private Synthetic Twin Data

Consider a setting where multiple parties holding sensitive data aim to ...
research
08/23/2019

You Can't Publish Replication Studies (and How to Anyways)

Reproducibility has been increasingly encouraged by communities of scien...

Please sign up or login with your details

Forgot password? Click here to reset