Synthetic Demographic Data Generation for Card Fraud Detection Using GANs

06/29/2023
by   Shuo Wang, et al.
0

Using machine learning models to generate synthetic data has become common in many fields. Technology to generate synthetic transactions that can be used to detect fraud is also growing fast. Generally, this synthetic data contains only information about the transaction, such as the time, place, and amount of money. It does not usually contain the individual user's characteristics (age and gender are occasionally included). Using relatively complex synthetic demographic data may improve the complexity of transaction data features, thus improving the fraud detection performance. Benefiting from developments of machine learning, some deep learning models have potential to perform better than other well-established synthetic data generation methods, such as microsimulation. In this study, we built a deep-learning Generative Adversarial Network (GAN), called DGGAN, which will be used for demographic data generation. Our model generates samples during model training, which we found important to overcame class imbalance issues. This study can help improve the cognition of synthetic data and further explore the application of synthetic data generation in card fraud detection.

READ FULL TEXT
research
03/11/2022

FedSyn: Synthetic Data Generation using Federated Learning

As Deep Learning algorithms continue to evolve and become more sophistic...
research
10/16/2022

Comparing Synthetic Tabular Data Generation Between a Probabilistic Model and a Deep Learning Model for Education Use Cases

The ability to generate synthetic data has a variety of use cases across...
research
03/02/2023

Analyzing Effects of Fake Training Data on the Performance of Deep Learning Systems

Deep learning models frequently suffer from various problems such as cla...
research
05/26/2023

Gender, Smoking History and Age Prediction from Laryngeal Images

Flexible laryngoscopy is commonly performed by otolaryngologists to dete...
research
09/26/2021

Synthetic Data Generation for Fraud Detection using GANs

Detecting money laundering in gambling is becoming increasingly challeng...
research
03/06/2022

Hybrid Deep Learning Model using SPCAGAN Augmentation for Insider Threat Analysis

Cyberattacks from within an organization's trusted entities are known as...
research
01/03/2021

Synthetic Embedding-based Data Generation Methods for Student Performance

Given the inherent class imbalance issue within student performance data...

Please sign up or login with your details

Forgot password? Click here to reset