DeepAI AI Chat
Log In Sign Up

Synthetic Data Generation for Economists

11/02/2020
by   Allison Koenecke, et al.
0

As more tech companies engage in rigorous economic analyses, we are confronted with a data problem: in-house papers cannot be replicated due to use of sensitive, proprietary, or private data. Readers are left to assume that the obscured true data (e.g., internal Google information) indeed produced the results given, or they must seek out comparable public-facing data (e.g., Google Trends) that yield similar results. One way to ameliorate this reproducibility issue is to have researchers release synthetic datasets based on their true data; this allows external parties to replicate an internal researcher's methodology. In this brief overview, we explore synthetic data generation at a high level for economic analyses.

READ FULL TEXT

page 1

page 2

page 3

page 4

04/07/2023

Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic Data

Generating synthetic data through generative models is gaining interest ...
04/07/2021

The Proper Use of Google Trends in Forecasting Models

It is widely known that Google Trends have become one of the most popula...
06/13/2022

Private Synthetic Data with Hierarchical Structure

We study the problem of differentially private synthetic data generation...
02/09/2022

Constructing synthetic populations in the age of big data

To develop public health intervention models using microsimulations, ext...
10/13/2022

Secure Multiparty Computation for Synthetic Data Generation from Distributed Data

Legal and ethical restrictions on accessing relevant data inhibit data s...
07/17/2021

Spatial Data Generators

This gem describes a standard method for generating synthetic spatial da...