Confidentiality and linked data

07/15/2019
by   Felix Ritchie, et al.
0

Data providers such as government statistical agencies perform a balancing act: maximising information published to inform decision-making and research, while simultaneously protecting privacy. The emergence of identified administrative datasets with the potential for sharing (and thus linking) offers huge potential benefits but significant additional risks. This article introduces the principles and methods of linking data across different sources and points in time, focusing on potential areas of risk. We then consider confidentiality risk, focusing in particular on the "intruder" problem central to the area, and looking at both risks from data producer outputs and from the release of micro-data for further analysis. Finally, we briefly consider potential solutions to micro-data release, both the statistical solutions considered in other contributed articles and non-statistical solutions.

READ FULL TEXT

page 1

page 9

page 14

page 18

page 19

page 20

page 21

page 30

research
02/17/2021

Differential Privacy for Government Agencies – Are We There Yet?

Government agencies always need to carefully consider potential risks of...
research
04/21/2023

Power to the Data Defenders: Human-Centered Disclosure Risk Calibration of Open Data

The open data ecosystem is susceptible to vulnerabilities due to disclos...
research
11/11/2022

Normative Challenges of Risk Regulation of Artificial Intelligence and Automated Decision-Making

Recent proposals aiming at regulating artificial intelligence (AI) and a...
research
11/30/2022

Risks to Zero Trust in a Federated Mission Partner Environment

Recent cybersecurity events have prompted the federal government to begi...
research
10/24/2017

Exploratory Study of the Privacy Extension for System Theoretic Process Analysis (STPA-Priv) to elicit Privacy Risks in eHealth

Context: System Theoretic Process Analysis for Privacy (STPA-Priv) is a ...
research
08/04/2017

Exploiting Redundancy, Recurrence and Parallelism: How to Link Millions of Addresses with Ten Lines of Code in Ten Minutes

Accurate and efficient record linkage is an open challenge of particular...
research
08/31/2023

Exact and Efficient Bayesian Inference for Privacy Risk Quantification (Extended Version)

Data analysis has high value both for commercial and research purposes. ...

Please sign up or login with your details

Forgot password? Click here to reset