Generating Synthetic Clinical Data that Capture Class Imbalanced Distributions with Generative Adversarial Networks: Example using Antiretroviral Therapy for HIV

08/18/2022
by   Nicholas I-Hsien Kuo, et al.
0

Clinical data usually cannot be freely distributed due to their highly confidential nature and this hampers the development of machine learning in the healthcare domain. One way to mitigate this problem is by generating realistic synthetic datasets using generative adversarial networks (GANs). However, GANs are known to suffer from mode collapse and thus creating outputs of low diveristy. In this paper, we extend the classic GAN setup with an external memory to replay features from real samples. Using antiretroviral therapy for human immunodeficiency virus (ART for HIV) as a case study, we show that our extended setup increases convergence and more importantly, it is effective in capturing the severe class imbalanced distributions common to real world clinical data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2021

VARGAN: Variance Enforcing Network Enhanced GAN

Generative adversarial networks (GANs) are one of the most widely used g...
research
08/20/2020

Conditional Wasserstein GAN-based Oversampling of Tabular Data for Imbalanced Learning

Class imbalance is a common problem in supervised learning and impedes t...
research
11/07/2018

Forging new worlds: high-resolution synthetic galaxies with chained generative adversarial networks

Astronomy of the 21st century finds itself with extreme quantities of da...
research
07/04/2018

Generating Synthetic but Plausible Healthcare Record Datasets

Generating datasets that "look like" given real ones is an interesting t...
research
04/09/2023

Distributed Conditional GAN (discGAN) For Synthetic Healthcare Data Generation

In this paper, we propose a distributed Generative Adversarial Networks ...
research
03/22/2022

Dazzle: Using Optimized Generative Adversarial Networks to Address Security Data Class Imbalance Issue

Background: Machine learning techniques have been widely used and demons...
research
08/04/2022

CIGAN: A Python Package for Handling Class Imbalance using Generative Adversarial Networks

A key challenge in Machine Learning is class imbalance, where the sample...

Please sign up or login with your details

Forgot password? Click here to reset