Consistency and Finite Sample Behavior of Binary Class Probability Estimation

08/30/2019
by   Alexander Mey, et al.
6

In this work we investigate to which extent one can recover class probabilities within the empirical risk minimization (ERM) paradigm. The main aim of our paper is to extend existing results and emphasize the tight relations between empirical risk minimization and class probability estimation. Based on existing literature on excess risk bounds and proper scoring rules, we derive a class probability estimator based on empirical risk minimization. We then derive fairly general conditions under which this estimator will converge, in the L1-norm and in probability, to the true class probabilities. Our main contribution is to present a way to derive finite sample L1-convergence rates of this estimator for different surrogate loss functions. We also study in detail which commonly used loss functions are suitable for this estimation problem and finally discuss the setting of model-misspecification as well as a possible extension to asymmetric loss functions.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 10

page 11

page 12

research
12/12/2013

Oracle Inequalities for Convex Loss Functions with Non-Linear Targets

This paper consider penalized empirical loss minimization of convex loss...
research
03/04/2015

Class Probability Estimation via Differential Geometric Regularization

We study the problem of supervised learning for both binary and multicla...
research
06/15/2015

Convex Risk Minimization and Conditional Probability Estimation

This paper proves, in very general settings, that convex risk minimizati...
research
10/09/2015

Conditional Risk Minimization for Stochastic Processes

We study the task of learning from non-i.i.d. data. In particular, we ai...
research
02/19/2019

Proper-Composite Loss Functions in Arbitrary Dimensions

The study of a machine learning problem is in many ways is difficult to ...
research
03/27/2023

On the Connection between L_p and Risk Consistency and its Implications on Regularized Kernel Methods

As a predictor's quality is often assessed by means of its risk, it is n...
research
03/12/2020

Benign overfitting in the large deviation regime

We investigate the benign overfitting phenomenon in the large deviation ...

Please sign up or login with your details

Forgot password? Click here to reset