A new analytical approach to consistency and overfitting in regularized empirical risk minimization

07/01/2016
by   Nicolas Garcia Trillos, et al.
0

This work considers the problem of binary classification: given training data x_1, ..., x_n from a certain population, together with associated labels y_1,..., y_n ∈{0,1 }, determine the best label for an element x not among the training data. More specifically, this work considers a variant of the regularized empirical risk functional which is defined intrinsically to the observed data and does not depend on the underlying population. Tools from modern analysis are used to obtain a concise proof of asymptotic consistency as regularization parameters are taken to zero at rates related to the size of the sample. These analytical tools give a new framework for understanding overfitting and underfitting, and rigorously connect the notion of overfitting with a loss of compactness.

READ FULL TEXT
research
05/09/2023

Testing for Overfitting

High complexity models are notorious in machine learning for overfitting...
research
03/02/2017

Positive-Unlabeled Learning with Non-Negative Risk Estimator

From only positive (P) and unlabeled (U) data, a binary classifier could...
research
09/07/2009

Lower Bounds for BMRM and Faster Rates for Training SVMs

Regularized risk minimization with the binary hinge loss and its variant...
research
05/24/2019

Perturbed Model Validation: A New Framework to Validate Model Relevance

This paper introduces PMV (Perturbed Model Validation), a new technique ...
research
04/14/2019

Analysis of overfitting in the regularized Cox model

The Cox proportional hazards model is ubiquitous in the analysis of time...
research
06/09/2019

Understanding overfitting peaks in generalization error: Analytical risk curves for l_2 and l_1 penalized interpolation

Traditionally in regression one minimizes the number of fitting paramete...
research
11/21/2021

Deep Image Prior using Stein's Unbiased Risk Estimator: SURE-DIP

Deep learning algorithms that rely on extensive training data are revolu...

Please sign up or login with your details

Forgot password? Click here to reset