Box Drawings for Learning with Imbalanced Data

03/13/2014
by   Siong Thye Goh, et al.
0

The vast majority of real world classification problems are imbalanced, meaning there are far fewer data from the class of interest (the positive class) than from other classes. We propose two machine learning algorithms to handle highly imbalanced classification problems. The classifiers constructed by both methods are created as unions of parallel axis rectangles around the positive examples, and thus have the benefit of being interpretable. The first algorithm uses mixed integer programming to optimize a weighted balance between positive and negative class accuracies. Regularization is introduced to improve generalization performance. The second method uses an approximation in order to assist with scalability. Specifically, it follows a characterize then discriminate approach, where the positive class is characterized first by boxes, and then each box boundary becomes a separate discriminative classifier. This method has the computational advantages that it can be easily parallelized, and considers only the relevant regions of feature space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2018

Superensemble Classifier for Improving Predictions in Imbalanced Datasets

Learning from an imbalanced dataset is a tricky proposition. Because the...
research
12/30/2021

The SAMME.C2 algorithm for severely imbalanced multi-class classification

Classification predictive modeling involves the accurate assignment of o...
research
11/21/2018

Adversarial Classifier for Imbalanced Problems

Adversarial approach has been widely used for data generation in the las...
research
11/10/2022

Review of Methods for Handling Class-Imbalanced in Classification Problems

Learning classifiers using skewed or imbalanced datasets can occasionall...
research
07/08/2020

Remix: Rebalanced Mixup

Deep image classifiers often perform poorly when training data are heavi...
research
05/23/2022

Throwing Away Data Improves Worst-Class Error in Imbalanced Classification

Class imbalances pervade classification problems, yet their treatment di...
research
05/05/2022

Automated Imbalanced Classification via Layered Learning

In this paper we address imbalanced binary classification (IBC) tasks. A...

Please sign up or login with your details

Forgot password? Click here to reset