Flexible and scalable privacy assessment for very large datasets, with an application to official governmental microdata

04/28/2022
by   Mário S. Alvim, et al.
0

We present a systematic refactoring of the conventional treatment of privacy analyses, basing it on mathematical concepts from the framework of Quantitative Information Flow (QIF). The approach we suggest brings three principal advantages: it is flexible, allowing for precise quantification and comparison of privacy risks for attacks both known and novel; it can be computationally tractable for very large, longitudinal datasets; and its results are explainable both to politicians and to the general public. We apply our approach to a very large case study: the Educational Censuses of Brazil, curated by the governmental agency INEP, which comprise over 90 attributes of approximately 50 million individuals released longitudinally every year since 2007. These datasets have only very recently (2018-2021) attracted legislation to regulate their privacy – while at the same time continuing to maintain the openness that had been sought in Brazilian society. INEP's reaction to that legislation was the genesis of our project with them. In our conclusions here we share the scientific, technical, and communication lessons we learned in the process.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2019

From Data Disclosure to Privacy Nudges: A Privacy-aware and User-centric Personal Data Management Framework

Although there are privacy-enhancing tools designed to protect users' on...
research
12/10/2020

Virtual Classrooms and Real Harms

Universities have been forced to rely on remote educational technology t...
research
04/02/2019

Data Disclosure under Perfect Sample Privacy

Perfect data privacy seems to be in fundamental opposition to the econom...
research
10/24/2017

Exploratory Study of the Privacy Extension for System Theoretic Process Analysis (STPA-Priv) to elicit Privacy Risks in eHealth

Context: System Theoretic Process Analysis for Privacy (STPA-Priv) is a ...
research
09/19/2022

A Framework for Preserving Privacy and Cybersecurity in Brain-Computer Interfacing Applications

Brain-Computer Interfaces (BCIs) comprise a rapidly evolving field of te...
research
05/13/2019

Smartwatch games: Encouraging privacy-protective behaviour in a longitudinal study

While the public claim concern for their privacy, they frequently appear...

Please sign up or login with your details

Forgot password? Click here to reset