Good Classification Measures and How to Find Them

01/22/2022
by   Martijn Gösgens, et al.
0

Several performance measures can be used for evaluating classification results: accuracy, F-measure, and many others. Can we say that some of them are better than others, or, ideally, choose one measure that is best in all situations? To answer this question, we conduct a systematic analysis of classification performance measures: we formally define a list of desirable properties and theoretically analyze which measures satisfy which properties. We also prove an impossibility theorem: some desirable properties cannot be simultaneously satisfied. Finally, we propose a new family of measures satisfying all desirable properties except one. This family includes the Matthews Correlation Coefficient and a so-called Symmetric Balanced Accuracy that was not previously used in classification literature. We believe that our systematic approach gives an important tool to practitioners for adequately evaluating classification results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2022

Characterizing Graph Datasets for Node Classification: Beyond Homophily-Heterophily Dichotomy

Homophily is a graph property describing the tendency of edges to connec...
research
12/14/2017

Rate of Change Analysis for Interestingness Measures

The use of Association Rule Mining techniques in diverse contexts and do...
research
08/07/2017

A Characterization of Monotone Influence Measures for Data Classification

In this work we focus on the following question: how important was the i...
research
11/12/2019

Systematic Analysis of Cluster Similarity Indices: Towards Bias-free Cluster Validation

There are many cluster similarity indices used to evaluate clustering al...
research
04/24/2017

Visual-Based Analysis of Classification Measures with Applications to Imbalanced Data

With a plethora of available classification performance measures, choosi...
research
11/10/2022

A classification performance evaluation measure considering data separability

Machine learning and deep learning classification models are data-driven...
research
08/13/2021

Stochastic orders and measures of skewness and dispersion based on expectiles

Recently, expectile-based measures of skewness have been introduced whic...

Please sign up or login with your details

Forgot password? Click here to reset