Class-Weighted Evaluation Metrics for Imbalanced Data Classification

10/12/2020
by   Akhilesh Gupta, et al.
23

Class distribution skews in imbalanced datasets may lead to models with prediction bias towards majority classes, making fair assessment of classifiers a challenging task. Balanced Accuracy is a popular metric used to evaluate a classifier's prediction performance under such scenarios. However, this metric falls short when classes vary in importance, especially when class importance is skewed differently from class cardinality distributions. In this paper, we propose a simple and general-purpose evaluation framework for imbalanced data classification that is sensitive to arbitrary skews in class cardinalities and importances. Experiments with several state-of-the-art classifiers tested on real-world datasets and benchmarks from two different domains show that our new framework is more effective than Balanced Accuracy – not only in evaluating and ranking model predictions, but also in training the models themselves.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2021

ABC: Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning

Existing semi-supervised learning (SSL) algorithms typically assume clas...
research
10/09/2020

Measuring What Counts: The case of Rumour Stance Classification

Stance classification can be a powerful tool for understanding whether a...
research
03/27/2023

Evaluating XGBoost for Balanced and Imbalanced Data: Application to Fraud Detection

This paper evaluates XGboost's performance given different dataset sizes...
research
10/16/2018

An empirical evaluation of imbalanced data strategies from a practitioner's point of view

This research tested the following well known strategies to deal with bi...
research
06/05/2022

Never mind the metrics – what about the uncertainty? Visualising confusion matrix metric distributions

There are strong incentives to build models that demonstrate outstanding...
research
11/02/2017

Oversampling for Imbalanced Learning Based on K-Means and SMOTE

Learning from class-imbalanced data continues to be a common and challen...

Please sign up or login with your details

Forgot password? Click here to reset