Effective Class-Imbalance learning based on SMOTE and Convolutional Neural Networks

Imbalanced Data (ID) is a problem that deters Machine Learning (ML) models for achieving satisfactory results. ID is the occurrence of a situation where the quantity of the samples belonging to one class outnumbers that of the other by a wide margin, making such models learning process biased towards the majority class. In recent years, to address this issue, several solutions have been put forward, which opt for either synthetically generating new data for the minority class or reducing the number of majority classes for balancing the data. Hence, in this paper, we investigate the effectiveness of methods based on Deep Neural Networks (DNNs) and Convolutional Neural Networks (CNNs), mixed with a variety of well-known imbalanced data solutions meaning oversampling and undersampling. To evaluate our methods, we have used KEEL, breast cancer, and Z-Alizadeh Sani datasets. In order to achieve reliable results, we conducted our experiments 100 times with randomly shuffled data distributions. The classification results demonstrate that the mixed Synthetic Minority Oversampling Technique (SMOTE)-Normalization-CNN outperforms different methodologies achieving 99.08 Therefore, the proposed mixed model can be applied to imbalanced binary classification problems on other real datasets.

READ FULL TEXT

page 5

page 6

page 7

page 9

page 10

page 13

page 27

page 28

research
07/06/2022

A Hybrid Approach for Binary Classification of Imbalanced Data

Binary classification with an imbalanced dataset is challenging. Models ...
research
10/09/2020

Handling Imbalanced Data: A Case Study for Binary Class Problems

For several years till date, the major issues in terms of solving for cl...
research
10/15/2017

A systematic study of the class imbalance problem in convolutional neural networks

In this study, we systematically investigate the impact of class imbalan...
research
11/19/2018

An Adaptive Oversampling Learning Method for Class-Imbalanced Fault Diagnostics and Prognostics

Data-driven fault diagnostics and prognostics suffers from class-imbalan...
research
02/04/2022

Stop Oversampling for Class Imbalance Learning: A Critical Review

For the last two decades, oversampling has been employed to overcome the...
research
07/28/2022

Deep learning for understanding multilabel imbalanced Chest X-ray datasets

Over the last few years, convolutional neural networks (CNNs) have domin...
research
01/06/2020

Consistent Batch Normalization for Weighted Loss in Imbalanced-Data Environment

In this study, we consider classification problems based on neural netwo...

Please sign up or login with your details

Forgot password? Click here to reset