Synthetic Data Generation for Economists

11/02/2020
by   Allison Koenecke, et al.
0

As more tech companies engage in rigorous economic analyses, we are confronted with a data problem: in-house papers cannot be replicated due to use of sensitive, proprietary, or private data. Readers are left to assume that the obscured true data (e.g., internal Google information) indeed produced the results given, or they must seek out comparable public-facing data (e.g., Google Trends) that yield similar results. One way to ameliorate this reproducibility issue is to have researchers release synthetic datasets based on their true data; this allows external parties to replicate an internal researcher's methodology. In this brief overview, we explore synthetic data generation at a high level for economic analyses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2023

Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic Data

Generating synthetic data through generative models is gaining interest ...
research
04/07/2021

The Proper Use of Google Trends in Forecasting Models

It is widely known that Google Trends have become one of the most popula...
research
08/01/2023

Advancing Microdata Privacy Protection: A Review of Synthetic Data

Synthetic data generation is a powerful tool for privacy protection when...
research
02/09/2022

Constructing synthetic populations in the age of big data

To develop public health intervention models using microsimulations, ext...
research
05/06/2022

Synthetic Data – what, why and how?

This explainer document aims to provide an overview of the current state...
research
07/17/2021

Spatial Data Generators

This gem describes a standard method for generating synthetic spatial da...

Please sign up or login with your details

Forgot password? Click here to reset