TabDDPM: Modelling Tabular Data with Diffusion Models

09/30/2022
by   Akim Kotelnikov, et al.
0

Denoising diffusion probabilistic models are currently becoming the leading paradigm of generative modeling for many important data modalities. Being the most prevalent in the computer vision community, diffusion models have also recently gained some attention in other domains, including speech, NLP, and graph-like data. In this work, we investigate if the framework of diffusion models can be advantageous for general tabular problems, where datapoints are typically represented by vectors of heterogeneous features. The inherent heterogeneity of tabular data makes it quite challenging for accurate modeling, since the individual features can be of completely different nature, i.e., some of them can be continuous and some of them can be discrete. To address such data types, we introduce TabDDPM – a diffusion model that can be universally applied to any tabular dataset and handles any type of feature. We extensively evaluate TabDDPM on a wide set of benchmarks and demonstrate its superiority over existing GAN/VAE alternatives, which is consistent with the advantage of diffusion models in other fields. Additionally, we show that TabDDPM is eligible for privacy-oriented setups, where the original datapoints cannot be publicly shared.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2022

Continuous diffusion for categorical data

Diffusion models have quickly become the go-to paradigm for generative m...
research
03/01/2023

Diffusion Probabilistic Fields

Diffusion probabilistic models have quickly become a major approach for ...
research
08/11/2023

Mirror Diffusion Models

Diffusion models have successfully been applied to generative tasks in v...
research
02/10/2023

Star-Shaped Denoising Diffusion Probabilistic Models

Methods based on Denoising Diffusion Probabilistic Models (DDPM) became ...
research
06/07/2023

A Survey on Generative Diffusion Models for Structured Data

In recent years, generative diffusion models have achieved a rapid parad...
research
06/24/2022

Source Localization of Graph Diffusion via Variational Autoencoders for Graph Inverse Problems

Graph diffusion problems such as the propagation of rumors, computer vir...
research
09/15/2023

Breathing New Life into 3D Assets with Generative Repainting

Diffusion-based text-to-image models ignited immense attention from the ...

Please sign up or login with your details

Forgot password? Click here to reset