Towards Competitive Classifiers for Unbalanced Classification Problems: A Study on the Performance Scores

Although a great methodological effort has been invested in proposing competitive solutions to the class-imbalance problem, little effort has been made in pursuing a theoretical understanding of this matter. In order to shed some light on this topic, we perform, through a novel framework, an exhaustive analysis of the adequateness of the most commonly used performance scores to assess this complex scenario. We conclude that using unweighted Hölder means with exponent p ≤ 1 to average the recalls of all the classes produces adequate scores which are capable of determining whether a classifier is competitive. Then, we review the major solutions presented in the class-imbalance literature. Since any learning task can be defined as an optimisation problem where a loss function, usually connected to a particular score, is minimised, our goal, here, is to find whether the learning tasks found in the literature are also oriented to maximise the previously detected adequate scores. We conclude that they usually maximise the unweighted Hölder mean with p = 1 (a-mean). Finally, we provide bounds on the values of the studied performance scores which guarantee a classifier with a higher recall than the random classifier in each and every class.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2019

Classification of Imbalanced Data with a Geometric Digraph Family

We use a geometric digraph family called class cover catch digraphs (CCC...
research
08/22/2016

Survey of resampling techniques for improving classification performance in unbalanced datasets

A number of classification problems need to deal with data imbalance bet...
research
01/15/2020

On Model Evaluation under Non-constant Class Imbalance

Many real-world classification problems are significantly class-imbalanc...
research
08/26/2020

Appropriateness of Performance Indices for Imbalanced Data Classification: An Analysis

Indices quantifying the performance of classifiers under class-imbalance...
research
11/18/2017

Prediction Scores as a Window into Classifier Behavior

Most multi-class classifiers make their prediction for a test sample by ...
research
02/28/2018

Constrained Classification and Ranking via Quantiles

In most machine learning applications, classification accuracy is not th...

Please sign up or login with your details

Forgot password? Click here to reset