Distributed Conditional GAN (discGAN) For Synthetic Healthcare Data Generation

04/09/2023
by   David Fuentes, et al.
0

In this paper, we propose a distributed Generative Adversarial Networks (discGANs) to generate synthetic tabular data specific to the healthcare domain. While using GANs to generate images has been well studied, little to no attention has been given to generation of tabular data. Modeling distributions of discrete and continuous tabular data is a non-trivial task with high utility. We applied discGAN to model non-Gaussian multi-modal healthcare data. We generated 249,000 synthetic records from original 2,027 eICU dataset. We evaluated the performance of the model using machine learning efficacy, the Kolmogorov-Smirnov (KS) test for continuous variables and chi-squared test for discrete variables. Our results show that discGAN was able to generate data with distributions similar to the real data.

READ FULL TEXT
research
06/01/2018

Natural Language Generation for Electronic Health Records

A variety of methods existing for generating synthetic electronic health...
research
01/25/2020

COR-GAN: Correlation-Capturing Convolutional Neural Networks for Generating Synthetic Healthcare Records

Deep learning models have demonstrated high-quality performance in areas...
research
07/01/2019

Modeling Tabular data using Conditional GAN

Modeling the probability distribution of rows in tabular data and genera...
research
12/12/2017

Logo Synthesis and Manipulation with Clustered Generative Adversarial Networks

Designing a logo for a new brand is a lengthy and tedious back-and-forth...
research
09/02/2021

TabFairGAN: Fair Tabular Data Generation with Generative Adversarial Networks

With the increasing reliance on automated decision making, the issue of ...
research
10/31/2019

Co-Generation with GANs using AIS based HMC

Inferring the most likely configuration for a subset of variables of a j...

Please sign up or login with your details

Forgot password? Click here to reset