Synthetic data generation method for hybrid image-tabular data using two generative adversarial networks

08/15/2023
by   Tomohiro Kikuchi, et al.
0

The generation of synthetic medical records using generative adversarial networks (GANs) has become increasingly important for addressing privacy concerns and promoting data sharing in the medical field. In this paper, we propose a novel method for generating synthetic hybrid medical records consisting of chest X-ray images (CXRs) and structured tabular data (including anthropometric data and laboratory tests) using an auto-encoding GAN (αGAN) and a conditional tabular GAN (CTGAN). Our approach involves training a αGAN model on a large public database (pDB) to reduce the dimensionality of CXRs. We then applied the trained encoder of the GAN model to the images in original database (oDB) to obtain the latent vectors. These latent vectors were combined with tabular data in oDB, and these joint data were used to train the CTGAN model. We successfully generated diverse synthetic records of hybrid CXR and tabular data, maintaining correspondence between them. We evaluated this synthetic database (sDB) through visual assessment, distribution of interrecord distances, and classification tasks. Our evaluation results showed that the sDB captured the features of the oDB while maintaining the correspondence between the images and tabular data. Although our approach relies on the availability of a large-scale pDB containing a substantial number of images with the same modality and imaging region as those in the oDB, this method has the potential for the public release of synthetic datasets without compromising the secondary use of data.

READ FULL TEXT

page 5

page 6

page 13

page 14

research
11/10/2020

Encoding large scale cosmological structure with Generative Adversarial Networks

Recently a type of neural networks called Generative Adversarial Network...
research
10/25/2018

GAN Augmentation: Augmenting Training Data using Generative Adversarial Networks

One of the biggest issues facing the use of machine learning in medical ...
research
10/05/2022

ciDATGAN: Conditional Inputs for Tabular GANs

Conditionality has become a core component for Generative Adversarial Ne...
research
09/26/2021

Synthetic Data Generation for Fraud Detection using GANs

Detecting money laundering in gambling is becoming increasingly challeng...
research
09/06/2017

Synthetic Medical Images from Dual Generative Adversarial Networks

Currently there is strong interest in data-driven approaches to medical ...
research
09/09/2020

Anonymization of labeled TOF-MRA images for brain vessel segmentation using generative adversarial networks

Anonymization and data sharing are crucial for privacy protection and ac...
research
04/05/2018

Feedback GAN (FBGAN) for DNA: a Novel Feedback-Loop Architecture for Optimizing Protein Functions

Generative Adversarial Networks (GANs) represent an attractive and novel...

Please sign up or login with your details

Forgot password? Click here to reset