Two-Phase Data Synthesis for Income: An Application to the NHIS

06/02/2020
by   Kevin Ros, et al.
0

We propose a two-phase synthesis process for synthesizing income, a sensitive variable which is usually highly-skewed and has a number of reported zeros. We consider two forms of a continuous income variable: a binary form, which is modeled and synthesized in phase 1; and a non-negative continuous form, which is modeled and synthesized in phase 2. Bayesian synthesis models are proposed for the two-phase synthesis process, and other synthesis models such as classification and regression trees (CART) are readily implementable. We demonstrate our methods as applications to a sample from the National Health Interview Survey (NHIS). Utility and risk profiles of generated synthetic datasets are evaluated and compared to results from a single-phase synthesis process.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2020

Identification Risk Evaluation of Continuous Synthesized Variables

We propose a general approach to evaluating identification risk of conti...
research
07/11/2006

Interactive Hatching and Stippling by Example

We describe a system that lets a designer interactively draw patterns of...
research
10/22/2020

How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers?

We have been working on speech synthesis for rakugo (a traditional Japan...
research
09/17/2021

Data Privacy Protection and Utility Preservation through Bayesian Data Synthesis: A Case Study on Airbnb Listings

When releasing record-level data containing sensitive information to the...
research
09/14/2020

Predictive Synthesis of Quantum Materials by Probabilistic Reinforcement Learning

Predictive materials synthesis is the primary bottleneck in realizing ne...
research
09/21/2021

An Audio Synthesis Framework Derived from Industrial Process Control

Since its conception, digital synthesis has significantly influenced the...

Please sign up or login with your details

Forgot password? Click here to reset