Causal-TGAN: Generating Tabular Data Using Causal Generative Adversarial Networks

04/21/2021
by   Bingyang Wen, et al.
0

Synthetic data generation becomes prevalent as a solution to privacy leakage and data shortage. Generative models are designed to generate a realistic synthetic dataset, which can precisely express the data distribution for the real dataset. The generative adversarial networks (GAN), which gain great success in the computer vision fields, are doubtlessly used for synthetic data generation. Though there are prior works that have demonstrated great progress, most of them learn the correlations in the data distributions rather than the true processes in which the datasets are naturally generated. Correlation is not reliable for it is a statistical technique that only tells linear dependencies and is easily affected by the dataset's bias. Causality, which encodes all underlying factors of how the real data be naturally generated, is more reliable than correlation. In this work, we propose a causal model named Causal Tabular Generative Neural Network (Causal-TGAN) to generate synthetic tabular data using the tabular data's causal information. Extensive experiments on both simulated datasets and real datasets demonstrate the better performance of our method when given the true causal graph and a comparable performance when using the estimated causal graph.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2023

Probabilistic matching of real and generated data statistics in generative adversarial networks

Generative adversarial networks constitute a powerful approach to genera...
research
01/03/2023

On the causality-preservation capabilities of generative modelling

Modeling lies at the core of both the financial and the insurance indust...
research
07/01/2019

Modeling Tabular data using Conditional GAN

Modeling the probability distribution of rows in tabular data and genera...
research
07/30/2021

Synthetic flow-based cryptomining attack generation through Generative Adversarial Networks

Due to the growing rise of cyber attacks in the Internet, flow-based dat...
research
10/05/2021

Top-N: Equivariant set and graph generation without exchangeability

We consider one-shot probabilistic decoders that map a vector-shaped pri...
research
07/01/2023

CasTGAN: Cascaded Generative Adversarial Network for Realistic Tabular Data Synthesis

Generative adversarial networks (GANs) have drawn considerable attention...
research
06/03/2022

Causality Learning With Wasserstein Generative Adversarial Networks

Conventional methods for causal structure learning from data face signif...

Please sign up or login with your details

Forgot password? Click here to reset