Superensemble Classifier for Improving Predictions in Imbalanced Datasets

10/25/2018
by   Tanujit Chakraborty, et al.
0

Learning from an imbalanced dataset is a tricky proposition. Because these datasets are biased towards one class, most existing classifiers tend not to perform well on minority class examples. Conventional classifiers usually aim to optimize the overall accuracy without considering the relative distribution of each class. This article presents a superensemble classifier, to tackle and improve predictions in imbalanced classification problems, that maps Hellinger distance decision trees (HDDT) into radial basis function network (RBFN) framework. Regularity conditions for universal consistency and the idea of parameter optimization of the proposed model are provided. The proposed distribution-free model can be applied for feature selection cum imbalanced classification problems. We have also provided enough numerical evidence using various real-life data sets to assess the performance of the proposed model. Its effectiveness and competitiveness with respect to different state-of-the-art models are shown.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2018

Superensemble classifier for learning from imbalanced business school data set

Private business schools in India face a common problem of selecting qua...
research
04/29/2018

A Nonparametric Ensemble Binary Classifier and its Statistical Properties

In this work, we propose an ensemble of classification trees (CT) and ar...
research
10/28/2022

Improving Multi-class Classifier Using Likelihood Ratio Estimation with Regularization

The universal-set naive Bayes classifier (UNB) <cit.>, defined using lik...
research
03/13/2014

Box Drawings for Learning with Imbalanced Data

The vast majority of real world classification problems are imbalanced, ...
research
04/19/2018

Instance Selection Improves Geometric Mean Accuracy: A Study on Imbalanced Data Classification

A natural way of handling imbalanced data is to attempt to equalise the ...
research
07/07/2018

Synthetic Sampling for Multi-Class Malignancy Prediction

We explore several oversampling techniques for an imbalanced multi-label...

Please sign up or login with your details

Forgot password? Click here to reset