Releasing survey microdata with exact cluster locations and additional privacy safeguards

05/24/2022
by   Till Koebe, et al.
0

Household survey programs around the world publish fine-granular georeferenced microdata to support research on the interdependence of human livelihoods and their surrounding environment. To safeguard the respondents' privacy, micro-level survey data is usually (pseudo)-anonymized through deletion or perturbation procedures such as obfuscating the true location of data collection. This, however, poses a challenge to emerging approaches that augment survey data with auxiliary information on a local level. Here, we propose an alternative microdata dissemination strategy that leverages the utility of the original microdata with additional privacy safeguards through synthetically generated data using generative models. We back our proposal with experiments using data from the 2011 Costa Rican census and satellite-derived auxiliary information. Our strategy reduces the respondents' re-identification risk for any number of disclosed attributes by 60-80% even under re-identification attempts.

READ FULL TEXT

page 1

page 6

page 22

research
01/22/2019

Perturbation Privacy for Sensitive Locations in Transit Data Publication: A Case Study of Montreal Trajet Surveys

Smartphone based travel data collection has become an important tool for...
research
03/26/2019

Privacy of trajectory micro-data : a survey

We survey the literature on the privacy of trajectory micro-data, i.e., ...
research
08/19/2022

Synthetic Data in Human Analysis: A Survey

Deep neural networks have become prevalent in human analysis, boosting t...
research
06/03/2016

Using Neural Generative Models to Release Synthetic Twitter Corpora with Reduced Stylometric Identifiability of Users

We present a method for generating synthetic versions of Twitter data us...
research
01/15/2021

Private Tabular Survey Data Products through Synthetic Microdata Generation

We propose three synthetic microdata approaches to generate private tabu...
research
07/26/2023

Online Context-aware Data Release with Sequence Information Privacy

Publishing streaming data in a privacy-preserving manner has been a key ...

Please sign up or login with your details

Forgot password? Click here to reset