Comparing Synthetic Tabular Data Generation Between a Probabilistic Model and a Deep Learning Model for Education Use Cases

10/16/2022
by   Herkulaas MvE Combrink, et al.
0

The ability to generate synthetic data has a variety of use cases across different domains. In education research, there is a growing need to have access to synthetic data to test certain concepts and ideas. In recent years, several deep learning architectures were used to aid in the generation of synthetic data but with varying results. In the education context, the sophistication of implementing different models requiring large datasets is becoming very important. This study aims to compare the application of synthetic tabular data generation between a probabilistic model specifically a Bayesian Network, and a deep learning model, specifically a Generative Adversarial Network using a classification task. The results of this study indicate that synthetic tabular data generation is better suited for the education context using probabilistic models (overall accuracy of 75 deep learning architecture (overall accuracy of 38 interdependence. Lastly, we recommend that other data types, should be explored and evaluated for their application in generating synthetic data for education use cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2023

Synthetic Demographic Data Generation for Card Fraud Detection Using GANs

Using machine learning models to generate synthetic data has become comm...
research
06/02/2023

Generation of Probabilistic Synthetic Data for Serious Games: A Case Study on Cyberbullying

Synthetic data generation has been a growing area of research in recent ...
research
06/22/2020

Improving LIME Robustness with Smarter Locality Sampling

Explainability algorithms such as LIME have enabled machine learning sys...
research
04/26/2021

Synthetic 3D Data Generation Pipeline for Geometric Deep Learning in Architecture

With the growing interest in deep learning algorithms and computational ...
research
07/28/2022

Sequential Models in the Synthetic Data Vault

The goal of this paper is to describe a system for generating synthetic ...
research
03/06/2022

Hybrid Deep Learning Model using SPCAGAN Augmentation for Insider Threat Analysis

Cyberattacks from within an organization's trusted entities are known as...
research
09/27/2020

STAN: Synthetic Network Traffic Generation using Autoregressive Neural Models

Deep learning models have achieved great success in recent years. Howeve...

Please sign up or login with your details

Forgot password? Click here to reset