Advancing Microdata Privacy Protection: A Review of Synthetic Data

08/01/2023
by   Jingchen Hu, et al.
0

Synthetic data generation is a powerful tool for privacy protection when considering public release of record-level data files. Initially proposed about three decades ago, it has generated significant research and application interest. To meet the pressing demand of data privacy protection in a variety of contexts, the field needs more researchers and practitioners. This review provides a comprehensive introduction to synthetic data, including technical details of their generation and evaluation. Our review also addresses the challenges and limitations of synthetic data, discusses practical applications, and provides thoughts for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2021

Bayesian Estimation of Attribute Disclosure Risks in Synthetic Data with the R Package

Synthetic data is a promising approach to privacy protection in many con...
research
08/19/2022

Synthetic Data in Human Analysis: A Survey

Deep neural networks have become prevalent in human analysis, boosting t...
research
02/05/2021

Measuring Utility and Privacy of Synthetic Genomic Data

Genomic data provides researchers with an invaluable source of informati...
research
04/04/2023

30 Years of Synthetic Data

The idea to generate synthetic data as a tool for broadening access to s...
research
03/15/2018

Strategies to facilitate access to detailed geocoding information using synthetic data

In this paper we investigate if generating synthetic data can be a viabl...
research
05/06/2022

Synthetic Data – what, why and how?

This explainer document aims to provide an overview of the current state...
research
11/02/2020

Synthetic Data Generation for Economists

As more tech companies engage in rigorous economic analyses, we are conf...

Please sign up or login with your details

Forgot password? Click here to reset