Unachievable Region in Precision-Recall Space and Its Effect on Empirical Evaluation

06/18/2012
by   Kendrick Boyd, et al.
0

Precision-recall (PR) curves and the areas under them are widely used to summarize machine learning results, especially for data sets exhibiting class skew. They are often used analogously to ROC curves and the area under ROC curves. It is known that PR curves vary as class skew changes. What was not recognized before this paper is that there is a region of PR space that is completely unachievable, and the size of this region depends only on the skew. This paper precisely characterizes the size of that region and discusses its implications for empirical evaluation methodology in machine learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2023

Precision and Recall Reject Curves for Classification

For some classification scenarios, it is desirable to use only those cla...
research
07/03/2020

The Effect of Class Imbalance on Precision-Recall Curves

In this note I study how the precision of a classifier depends on the ra...
research
05/14/2019

Revisiting Precision and Recall Definition for Generative Model Evaluation

In this article we revisit the definition of Precision-Recall (PR) curve...
research
04/04/2023

Clustering Validation with The Area Under Precision-Recall Curves

Confusion matrices and derived metrics provide a comprehensive framework...
research
10/19/2018

Population and Empirical PR Curves for Assessment of Ranking Algorithms

The ROC curve is widely used to assess the quality of prediction/classif...
research
06/27/2023

An Empirical Evaluation of the Rashomon Effect in Explainable Machine Learning

The Rashomon Effect describes the following phenomenon: for a given data...

Please sign up or login with your details

Forgot password? Click here to reset