Differentiable and Scalable Generative Adversarial Models for Data Imputation

01/10/2022
by   Yangyang Wu, et al.
9

Data imputation has been extensively explored to solve the missing data problem. The dramatically increasing volume of incomplete data makes the imputation models computationally infeasible in many real-life applications. In this paper, we propose an effective scalable imputation system named SCIS to significantly speed up the training of the differentiable generative adversarial imputation models under accuracy-guarantees for large-scale incomplete data. SCIS consists of two modules, differentiable imputation modeling (DIM) and sample size estimation (SSE). DIM leverages a new masking Sinkhorn divergence function to make an arbitrary generative adversarial imputation model differentiable, while for such a differentiable imputation model, SSE can estimate an appropriate sample size to ensure the user-specified imputation accuracy of the final model. Extensive experiments upon several real-life large-scale datasets demonstrate that, our proposed system can accelerate the generative adversarial model training by 7.1x. Using around 7.6 samples, SCIS yields competitive accuracy with the state-of-the-art imputation methods in a much shorter computation time.

READ FULL TEXT
research
12/23/2020

IFGAN: Missing Value Imputation using Feature-specific Generative Adversarial Networks

Missing value imputation is a challenging and well-researched topic in d...
research
03/09/2022

FragmGAN: Generative Adversarial Nets for Fragmentary Data Imputation and Prediction

Modern scientific research and applications very often encounter "fragme...
research
03/19/2023

Generative Adversarial Classification Network with Application to Network Traffic Classification

Large datasets in machine learning often contain missing data, which nec...
research
07/31/2021

Missingness Augmentation: A General Approach for Improving Generative Imputation Models

Despite tremendous progress in missing data imputation task, designing n...
research
02/06/2023

ClueGAIN: Application of Transfer Learning On Generative Adversarial Imputation Nets (GAIN)

Many studies have attempted to solve the problem of missing data using v...
research
08/22/2017

VIGAN: Missing View Imputation with Generative Adversarial Networks

In an era when big data are becoming the norm, there is less concern wit...
research
01/26/2022

Generative Trees: Adversarial and Copycat

While Generative Adversarial Networks (GANs) achieve spectacular results...

Please sign up or login with your details

Forgot password? Click here to reset