Private and Collaborative Kaplan-Meier Estimators

05/24/2023
by   Shadi Rahimian, et al.
0

Kaplan-Meier estimators capture the survival behavior of a cohort. They are one of the key statistics in survival analysis. As with any estimator, they become more accurate in presence of larger datasets. This motivates multiple data holders to share their data in order to calculate a more accurate Kaplan-Meier estimator. However, these survival datasets often contain sensitive information of individuals and it is the responsibility of the data holders to protect their data, thus a naive sharing of data is often not viable. In this work, we propose two novel differentially private schemes that are facilitated by our novel synthetic dataset generation method. Based on these scheme we propose various paths that allow a joint estimation of the Kaplan-Meier curves with strict privacy guarantees. Our contribution includes a taxonomy of methods for this task and an extensive experimental exploration and evaluation based on this structure. We show that we can construct a joint, global Kaplan-Meier estimator which satisfies very tight privacy guarantees and with no statistically-significant utility loss compared to the non-private centralized setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/04/2019

Differentially Private Survival Function Estimation

Survival function estimation is used in many disciplines, but it is most...
research
08/09/2023

Collaborative Learning From Distributed Data With Differentially Private Synthetic Twin Data

Consider a setting where multiple parties holding sensitive data aim to ...
research
07/02/2017

Privacy-Preserving Mechanisms for Parametric Survival Analysis with Weibull Distribution

Survival analysis studies the statistical properties of the time until a...
research
08/24/2017

Differentially Private Regression for Discrete-Time Survival Analysis

In survival analysis, regression models are used to understand the effec...
research
05/28/2022

MC-GEN:Multi-level Clustering for Private Synthetic Data Generation

Nowadays, machine learning is one of the most common technology to turn ...
research
10/28/2019

Improved Differentially Private Decentralized Source Separation for fMRI Data

Blind source separation algorithms such as independent component analysi...
research
06/02/2022

Impact of Sampling on Locally Differentially Private Data Collection

With the recent bloom of data, there is a huge surge in threats against ...

Please sign up or login with your details

Forgot password? Click here to reset