Optimal Decision-Theoretic Classification Using Non-Decomposable Performance Metrics

05/07/2015
by   Nagarajan Natarajan, et al.
0

We provide a general theoretical analysis of expected out-of-sample utility, also referred to as decision-theoretic classification, for non-decomposable binary classification metrics such as F-measure and Jaccard coefficient. Our key result is that the expected out-of-sample utility for many performance metrics is provably optimized by a classifier which is equivalent to a signed thresholding of the conditional probability of the positive class. Our analysis bridges a gap in the literature on binary classification, revealed in light of recent results for non-decomposable metrics in population utility maximization style classification. Our results identify checkable properties of a performance metric which are sufficient to guarantee a probability ranking principle. We propose consistent estimators for optimal expected out-of-sample classification. As a consequence of the probability ranking principle, computational requirements can be reduced from exponential to cubic complexity in the general case, and further reduced to quadratic complexity in special cases. We provide empirical results on simulated and benchmark datasets evaluating the performance of the proposed algorithms for decision-theoretic classification and comparing them to baseline and state-of-the-art methods in population utility maximization for non-decomposable metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/23/2016

Online Classification with Complex Metrics

We present a framework and analysis of consistent binary classification ...
research
02/21/2023

Does the evaluation stand up to evaluation? A first-principle approach to the evaluation of classifiers

How can one meaningfully make a measurement, if the meter does not confo...
research
04/09/2018

A plug-in approach to maximising precision at the top and recall at the top

For information retrieval and binary classification, we show that precis...
research
05/29/2019

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification

Complex classification performance metrics such as the F_β-measure and J...
research
02/21/2023

Don't guess what's true: choose what's optimal. A probability transducer for machine-learning classifiers

In fields such as medicine and drug discovery, the ultimate goal of a cl...
research
08/24/2019

Consistent Classification with Generalized Metrics

We propose a framework for constructing and analyzing multiclass and mul...
research
07/19/2022

Selecting applicants based on multiple ratings: Using binary classification framework as an alternative to inter-rater reliability

Inter-rater reliability (IRR) has been the prevalent quality and precisi...

Please sign up or login with your details

Forgot password? Click here to reset