SMOTified-GAN for class imbalanced pattern classification problems

08/06/2021
by   Anuraganand Sharma, et al.
0

Class imbalance in a dataset is a major problem for classifiers that results in poor prediction with a high true positive rate (TPR) but a low true negative rate (TNR) for a majority positive training dataset. Generally, the pre-processing technique of oversampling of minority class(es) are used to overcome this deficiency. Our focus is on using the hybridization of Generative Adversarial Network (GAN) and Synthetic Minority Over-Sampling Technique (SMOTE) to address class imbalanced problems. We propose a novel two-phase oversampling approach that has the synergy of SMOTE and GAN. The initial data of minority class(es) generated by SMOTE is further enhanced by GAN that produces better quality samples. We named it SMOTified-GAN as GAN works on pre-sampled minority data produced by SMOTE rather than randomly generating the samples itself. The experimental results prove the sample quality of minority class(es) has been improved in a variety of tested benchmark datasets. Its performance is improved by up to 9% from the next best algorithm tested on F1-score measurements. Its time complexity is also reasonable which is around O(N^2d^2T) for a sequential algorithm.

READ FULL TEXT
research
10/23/2022

Imbalanced Class Data Performance Evaluation and Improvement using Novel Generative Adversarial Network-based Approach: SSG and GBO

Class imbalance in a dataset is one of the major challenges that can sig...
research
01/27/2022

FinGAN: Generative Adversarial Network for Analytical Customer Relationship Management in Banking and Insurance

Churn prediction in credit cards, fraud detection in insurance, and loan...
research
05/16/2023

BSGAN: A Novel Oversampling Technique for Imbalanced Pattern Recognitions

Class imbalanced problems (CIP) are one of the potential challenges in d...
research
11/18/2022

How to train your draGAN: A task oriented solution to imbalanced classification

The long-standing challenge of building effective classification models ...
research
04/19/2022

Imbalanced Classification via a Tabular Translation GAN

When presented with a binary classification problem where the data exhib...
research
09/28/2020

Balancing thermal comfort datasets: We GAN, but should we?

Thermal comfort assessment for the built environment has become more ava...
research
03/22/2020

Deep Synthetic Minority Over-Sampling Technique

Synthetic Minority Over-sampling Technique (SMOTE) is the most popular o...

Please sign up or login with your details

Forgot password? Click here to reset