Binary Classification: Counterbalancing Class Imbalance by Applying Regression Models in Combination with One-Sided Label Shifts

11/30/2020
by   Peter Bellmann, et al.
0

In many real-world pattern recognition scenarios, such as in medical applications, the corresponding classification tasks can be of an imbalanced nature. In the current study, we focus on binary, imbalanced classification tasks, i.e. binary classification tasks in which one of the two classes is under-represented (minority class) in comparison to the other class (majority class). In the literature, many different approaches have been proposed, such as under- or oversampling, to counter class imbalance. In the current work, we introduce a novel method, which addresses the issues of class imbalance. To this end, we first transfer the binary classification task to an equivalent regression task. Subsequently, we generate a set of negative and positive target labels, such that the corresponding regression task becomes balanced, with respect to the redefined target label set. We evaluate our approach on a number of publicly available data sets in combination with Support Vector Machines. Moreover, we compare our proposed method to one of the most popular oversampling techniques (SMOTE). Based on the detailed discussion of the presented outcomes of our experimental evaluation, we provide promising ideas for future research directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2021

Statistical Theory for Imbalanced Binary Classification

Within the vast body of statistical theory developed for binary classifi...
research
05/23/2021

A Study imbalance handling by various data sampling methods in binary classification

The purpose of this research report is to present the our learning curve...
research
07/27/2023

Retrieval-based Text Selection for Addressing Class-Imbalanced Data in Classification

This paper addresses the problem of selecting of a set of texts for anno...
research
08/05/2019

Imbalance-XGBoost: Leveraging Weighted and Focal Losses for Binary Label-Imbalanced Classification with XGBoost

The paper presents Imbalance-XGBoost, a Python package that combines the...
research
06/14/2018

Analysis of the Effect of Unexpected Outliers in the Classification of Spectroscopy Data

Multi-class classification algorithms are very widely used, but we argue...
research
06/14/2019

Binary Classification using Pairs of Minimum Spanning Trees or N-ary Trees

One-class classifiers are trained with target class only samples. Intuit...
research
07/12/2017

Influence of Resampling on Accuracy of Imbalanced Classification

In many real-world binary classification tasks (e.g. detection of certai...

Please sign up or login with your details

Forgot password? Click here to reset