Health Data in an Open World

12/15/2017
by   Chris Culnane, et al.
0

With the aim of informing sound policy about data sharing and privacy, we describe successful re-identification of patients in an Australian de-identified open health dataset. As in prior studies of similar datasets, a few mundane facts often suffice to isolate an individual. Some people can be identified by name based on publicly available information. Decreasing the precision of the unit-record level data, or perturbing it statistically, makes re-identification gradually harder at a substantial cost to utility. We also examine the value of related datasets in improving the accuracy and confidence of re-identification. Our re-identifications were performed on a 10 dataset, but a related open Australian dataset allows us to infer with high confidence that some individuals in the sample have been correctly re-identified. Finally, we examine the combination of the open datasets with some commercial datasets that are known to exist but are not in our possession. We show that they would further increase the ease of re-identification.

READ FULL TEXT
research
06/10/2016

De-identification of Patient Notes with Recurrent Neural Networks

Objective: Patient notes in electronic health records (EHRs) may contain...
research
01/21/2022

AI-based Re-identification of Behavioral Clickstream Data

AI-based face recognition, i.e., the re-identification of individuals wi...
research
06/02/2016

Mobile phone data for public health: towards data-sharing solutions that protect individual privacy and national security

We outline the constraints faced by operators when deciding to share de-...
research
01/27/2019

Automatic end-to-end De-identification: Is high accuracy the only metric?

De-identification of electronic health records (EHR) is a vital step tow...
research
05/18/2023

In the Name of Fairness: Assessing the Bias in Clinical Record De-identification

Data sharing is crucial for open science and reproducible research, but ...
research
07/31/2023

A Trajectory K-Anonymity Model Based on Point Density and Partition

As people's daily life becomes increasingly inseparable from various mob...
research
06/12/2018

Is India's Unique Identification Number a legally valid identification?

A legally valid identification document allows impartial arbitration of ...

Please sign up or login with your details

Forgot password? Click here to reset