Superensemble classifier for learning from imbalanced business school data set

05/31/2018
by   Tanujit Chakraborty, et al.
0

Private business schools in India face a common problem of selecting quality students for their MBA programs to achieve desired placement percentage. Business school data set is biased towards one class, i.e., imbalanced in nature. And learning from imbalanced data set is a difficult proposition. Most existing classification methods tend not to perform well on minority class examples when the data set is extremely imbalanced, because they aim to optimize the overall accuracy without considering the relative distribution of each class. The aim of the paper is twofold. We first propose an integrated sampling technique with an ensemble of classification tree (CT) and artificial neural network (ANN) model as one of the methodologies which works better compared to other similar methods. Further we propose a superensemble imbalanced classifier which works better on the original business school data set. Our proposed superensemble classifier not only handles the imbalance data set but also achieves higher accuracy in case of feature selection cum classification problems. The proposal has been compared with other state-of-the-art classifiers and found to be very competitive.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2018

Superensemble Classifier for Improving Predictions in Imbalanced Datasets

Learning from an imbalanced dataset is a tricky proposition. Because the...
research
04/06/2021

Survey of Imbalanced Data Methodologies

Imbalanced data set is a problem often found and well-studied in financi...
research
09/18/2021

An Empirical Evaluation of the t-SNE Algorithm for Data Visualization in Structural Engineering

A fundamental task in machine learning involves visualizing high-dimensi...
research
04/29/2021

Recognition and Processing of NATOM

In this paper we show how to process the NOTAM (Notice to Airmen) data o...
research
04/29/2018

A Nonparametric Ensemble Binary Classifier and its Statistical Properties

In this work, we propose an ensemble of classification trees (CT) and ar...
research
11/25/2019

Improvement of Batch Normalization in Imbalanced Data

In this study, we consider classification problems based on neural netwo...
research
11/19/2017

Lung nodule classification by THE combination of Fusion classifier and Cascaded Convolutional Neural Networks

Lung nodule classification is a class imbalanced problem, as nodules are...

Please sign up or login with your details

Forgot password? Click here to reset