Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation

10/11/2020
by   David M. W. Powers, et al.
0

Commonly used evaluation measures including Recall, Precision, F-Measure and Rand Accuracy are biased and should not be used without clear understanding of the biases, and corresponding identification of chance or base case levels of the statistic. Using these measures a system that performs worse in the objective sense of Informedness, can appear to perform better under any of these commonly used measures. We discuss several concepts and measures that reflect the probability that prediction is informed versus chance. Informedness and introduce Markedness as a dual measure for the probability that prediction is marked versus chance. Finally we demonstrate elegant connections between the concepts of Informedness, Markedness, Correlation and Significance as well as their intuitive relationships with Recall and Precision, and outline the extension from the dichotomous case to the general multi-class case.

READ FULL TEXT
research
07/03/2020

The Effect of Class Imbalance on Precision-Recall Curves

In this note I study how the precision of a classifier depends on the ra...
research
04/03/2015

Evaluation Evaluation a Monte Carlo study

Over the last decade there has been increasing concern about the biases ...
research
04/09/2018

A plug-in approach to maximising precision at the top and recall at the top

For information retrieval and binary classification, we show that precis...
research
08/17/2010

A unifying view for performance measures in multi-class prediction

In the last few years, many different performance measures have been int...
research
07/31/2020

F*: An Interpretable Transformation of the F-measure

The F-measure is widely used to assess the performance of classification...
research
08/13/2020

Statistical Evaluation of Anomaly Detectors for Sequences

Although precision and recall are standard performance measures for anom...
research
03/19/2023

Two Kinds of Recall

It is an established assumption that pattern-based models are good at pr...

Please sign up or login with your details

Forgot password? Click here to reset