Synthetic Data Generation for Fraud Detection using GANs

09/26/2021
by   Charitos Charitou, et al.
9

Detecting money laundering in gambling is becoming increasingly challenging for the gambling industry as consumers migrate to online channels. Whilst increasingly stringent regulations have been applied over the years to prevent money laundering in gambling, despite this, online gambling is still a channel for criminals to spend proceeds from crime. Complementing online gambling's growth more concerns are raised to its effects compared with gambling in traditional, physical formats, as it might introduce higher levels of problem gambling or fraudulent behaviour due to its nature of immediate interaction with online gambling experience. However, in most cases the main issue when organisations try to tackle those areas is the absence of high quality data. Since fraud detection related issues face the significant problem of the class imbalance, in this paper we propose a novel system based on Generative Adversarial Networks (GANs) for generating synthetic data in order to train a supervised classifier. Our framework Synthetic Data Generation GAN (SDG-GAN), manages to outperformed density based over-sampling methods and improve the classification performance of benchmarks datasets and the real world gambling fraud dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2019

G2R Bound: A Generalization Bound for Supervised Learning from GAN-Synthetic Data

Performing supervised learning from the data synthesized by using Genera...
research
06/29/2023

Synthetic Demographic Data Generation for Card Fraud Detection Using GANs

Using machine learning models to generate synthetic data has become comm...
research
08/15/2023

Synthetic data generation method for hybrid image-tabular data using two generative adversarial networks

The generation of synthetic medical records using generative adversarial...
research
01/03/2023

On the causality-preservation capabilities of generative modelling

Modeling lies at the core of both the financial and the insurance indust...
research
01/13/2023

Short-time SSVEP data extension by a novel generative adversarial networks based framework

Steady-state visual evoked potentials (SSVEPs) based brain-computer inte...
research
09/10/2019

Sampling Strategies for GAN Synthetic Data

Generative Adversarial Networks (GANs) have been used widely to generate...
research
08/02/2023

Exploiting Synthetic Data for Data Imbalance Problems: Baselines from a Data Perspective

We live in a vast ocean of data, and deep neural networks are no excepti...

Please sign up or login with your details

Forgot password? Click here to reset