Generation and Simulation of Synthetic Datasets with Copulas

03/30/2022
by   Régis Houssou, et al.
0

This paper proposes a new method to generate synthetic data sets based on copula models. Our goal is to produce surrogate data resembling real data in terms of marginal and joint distributions. We present a complete and reliable algorithm for generating a synthetic data set comprising numeric or categorical variables. Applying our methodology to two datasets shows better performance compared to other methods such as SMOTE and autoencoders.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2015

Learning Classifiers from Synthetic Data Using a Multichannel Autoencoder

We propose a method for using synthetic data to help learning classifier...
research
09/27/2021

Derivative Extrapolation Using Least Squares

Here, we present three methods for differentiating discrete sets from st...
research
07/17/2021

A new Method for the in vivo identification of material properties of the human eye. Feasibility analysis based on synthetic data

This paper proposes a new method for in vivo and almost real-time identi...
research
12/18/2013

Perturbed Gibbs Samplers for Synthetic Data Release

We propose a categorical data synthesizer with a quantifiable disclosure...
research
11/13/2021

HydraGAN A Multi-head, Multi-objective Approach to Synthetic Data Generation

Synthetic data generation overcomes limitations of real-world machine le...

Please sign up or login with your details

Forgot password? Click here to reset