STaSy: Score-based Tabular data Synthesis

10/08/2022
by   Jayoung Kim, et al.
0

Tabular data synthesis is a long-standing research topic in machine learning. Many different methods have been proposed over the past decades, ranging from statistical methods to deep generative methods. However, it has not always been successful due to the complicated nature of real-world tabular data. In this paper, we present a new model named Score-based Tabular data Synthesis (STaSy) and its training strategy based on the paradigm of score-based generative modeling. Despite the fact that score-based generative models have resolved many issues in generative models, there still exists room for improvement in tabular data synthesis. Our proposed training strategy includes a self-paced learning technique and a fine-tuning strategy, which further increases the sampling quality and diversity by stabilizing the denoising score matching training. Furthermore, we also conduct rigorous experimental studies in terms of the generative task trilemma: sampling quality, diversity, and time. In our experiments with 15 benchmark tabular datasets and 7 baselines, our method outperforms existing methods in terms of task-dependant evaluations and diversity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2023

Regular Time-series Generation using SGM

Score-based generative models (SGMs) are generative models that are in t...
research
06/17/2022

SOS: Score-based Oversampling for Tabular Data

Score-based generative models (SGMs) are a recent breakthrough in genera...
research
06/29/2022

SPI-GAN: Distilling Score-based Generative Models with Straight-Path Interpolations

Score-based generative models (SGMs) are a recently proposed paradigm fo...
research
04/08/2021

On tuning consistent annealed sampling for denoising score matching

Score-based generative models provide state-of-the-art quality for image...
research
06/08/2022

Accelerating Score-based Generative Models for High-Resolution Image Synthesis

Score-based generative models (SGMs) have recently emerged as a promisin...
research
09/26/2022

Quasi-Conservative Score-based Generative Models

Existing Score-based Generative Models (SGMs) can be categorized into co...
research
10/26/2022

Full-band General Audio Synthesis with Score-based Diffusion

Recent works have shown the capability of deep generative models to tack...

Please sign up or login with your details

Forgot password? Click here to reset