Generative Modeling of Complex Data

02/04/2022
by   Luca Canale, et al.
6

In recent years, several models have improved the capacity to generate synthetic tabular datasets. However, such models focus on synthesizing simple columnar tables and are not useable on real-life data with complex structures. This paper puts forward a generic framework to synthesize more complex data structures with composite and nested types. It then proposes one practical implementation, built with causal transformers, for struct (mappings of types) and lists (repeated instances of a type). The results on standard benchmark datasets show that such implementation consistently outperforms current state-of-the-art models both in terms of machine learning utility and statistical similarity. Moreover, it shows very strong results on two complex hierarchical datasets with multiple nesting and sparse data, that were previously out of reach.

READ FULL TEXT
research
07/04/2019

Multiple membership multilevel models

Multiple membership multilevel models are an extension of standard multi...
research
08/28/2018

A Short Note on Collecting Dependently Typed Values

Within dependently typed languages, such as Idris, types can depend on v...
research
08/22/2023

Transformers for Capturing Multi-level Graph Structure using Hierarchical Distances

Graph transformers need strong inductive biases to derive meaningful att...
research
02/16/2021

CTAB-GAN: Effective Table Data Synthesizing

While data sharing is crucial for knowledge development, privacy concern...
research
07/04/2019

Cross-classified multilevel models

Cross-classified multilevel modelling is an extension of standard multil...
research
05/17/2018

Generic Deriving of Generic Traversals

Functional programmers have an established tradition of using traversals...
research
06/24/2021

FitVid: Overfitting in Pixel-Level Video Prediction

An agent that is capable of predicting what happens next can perform a v...

Please sign up or login with your details

Forgot password? Click here to reset