Pearson-Matthews correlation coefficients for binary and multinary classification and hypothesis testing

05/10/2023
by   Petre Stoica, et al.
0

The Pearson-Matthews correlation coefficient (usually abbreviated MCC) is considered to be one of the most useful metrics for the performance of a binary classification or hypothesis testing method (for the sake of conciseness we will use the classification terminology throughout, but the concepts and methods discussed in the paper apply verbatim to hypothesis testing as well). For multinary classification tasks (with more than two classes) the existing extension of MCC, commonly called the R_K metric, has also been successfully used in many applications. The present paper begins with an introductory discussion on certain aspects of MCC. Then we go on to discuss the topic of multinary classification that is the main focus of this paper and which, despite its practical and theoretical importance, appears to be less developed than the topic of binary classification. Our discussion of the R_K is followed by the introduction of two other metrics for multinary classification derived from the multivariate Pearson correlation (MPC) coefficients. We show that both R_K and the MPC metrics suffer from the problem of not decisively indicating poor classification results when they should, and introduce three new enhanced metrics that do not suffer from this problem. We also present an additional new metric for multinary classification which can be viewed as a direct extension of MCC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2022

Achievable Error Exponents for Almost Fixed-Length Hypothesis Testing and Classification

We revisit multiple hypothesis testing and propose a two-phase test, whe...
research
09/28/2019

Nonzero-sum Adversarial Hypothesis Testing Games

We study nonzero-sum hypothesis testing games that arise in the context ...
research
02/21/2023

Does the evaluation stand up to evaluation? A first-principle approach to the evaluation of classifiers

How can one meaningfully make a measurement, if the meter does not confo...
research
01/31/2022

On Sub-optimality of Random Binning for Distributed Hypothesis Testing

We investigate the quantize and binning scheme, known as the Shimokawa-H...
research
04/06/2021

Taming Adversarial Robustness via Abstaining

In this work, we consider a binary classification problem and cast it in...
research
02/25/2020

General Framework for Binary Classification on Top Samples

Many binary classification problems minimize misclassification above (or...
research
09/12/2022

Analysis and Comparison of Classification Metrics

A number of different performance metrics are commonly used in the machi...

Please sign up or login with your details

Forgot password? Click here to reset