A surrogate loss function for optimization of F_β score in binary classification with imbalanced data

04/03/2021
by   Namgil Lee, et al.
0

The F_β score is a commonly used measure of classification performance, which plays crucial roles in classification tasks with imbalanced data sets. However, the F_β score cannot be used as a loss function by gradient-based learning algorithms for optimizing neural network parameters due to its non-differentiability. On the other hand, commonly used loss functions such as the binary cross-entropy (BCE) loss are not directly related to performance measures such as the F_β score, so that neural networks optimized by using the loss functions may not yield optimal performance measures. In this study, we investigate a relationship between classification performance measures and loss functions in terms of the gradients with respect to the model parameters. Then, we propose a differentiable surrogate loss function for the optimization of the F_β score. We show that the gradient paths of the proposed surrogate F_β loss function approximate the gradient paths of the large sample limit of the F_β score. Through numerical experiments using ResNets and benchmark image data sets, it is demonstrated that the proposed surrogate F_β loss function is effective for optimizing F_β scores under class imbalances in binary classification tasks compared with other loss functions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2022

Xtreme Margin: A Tunable Loss Function for Binary Classification Problems

Loss functions drive the optimization of machine learning algorithms. Th...
research
08/24/2021

sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification

Multiclass multilabel classification refers to the task of attributing m...
research
06/20/2022

Model Optimization in Imbalanced Regression

Imbalanced domain learning aims to produce accurate models in predicting...
research
05/29/2019

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification

Complex classification performance metrics such as the F_β-measure and J...
research
12/13/2017

Differentiable lower bound for expected BLEU score

In natural language processing tasks performance of the models is often ...
research
07/22/2022

Classification via score-based generative modelling

In this work, we investigated the application of score-based gradient le...
research
10/29/2022

Reformulating van Rijsbergen's F_β metric for weighted binary cross-entropy

The separation of performance metrics from gradient based loss functions...

Please sign up or login with your details

Forgot password? Click here to reset