Cross-validation

03/09/2017
by   Sylvain Arlot, et al.
0

This text is a survey on cross-validation. We define all classical cross-validation procedures, and we study their properties for two different goals: estimating the risk of a given estimator, and selecting the best estimator among a given family. For the risk estimation problem, we compute the bias (which can also be corrected) and the variance of cross-validation methods. For estimator selection, we first provide a first-order analysis (based on expectations). Then, we explain how to take into account second-order terms (from variance computations, and by taking into account the usefulness of overpenalization). This allows, in the end, to provide some guidelines for choosing the best cross-validation method for a given learning problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2021

Bagging cross-validated bandwidth selection in nonparametric regression estimation with applications to large-sized samples

Cross-validation is a well-known and widely used bandwidth selection met...
research
02/28/2013

Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average

We investigate the accuracy of the two most common estimators for the ma...
research
11/14/2017

On Optimal Generalizability in Parametric Learning

We consider the parametric learning problem, where the objective of the ...
research
02/26/2018

Estimation of Local Degree Distributions via Local Weighted Averaging and Monte Carlo Cross-Validation

Owing to their capability of summarising interactions between elements o...
research
02/01/2023

Adaptive hedging horizon and hedging performance estimation

In this study, we constitute an adaptive hedging method based on empiric...
research
08/01/2019

Estimating the Standard Error of Cross-Validation-Based Estimators of Classification Rules Performance

First, we analyze the variance of the Cross Validation (CV)-based estima...
research
02/16/2018

Bayesian cross-validation of geostatistical models

The problem of validating or criticising models for georeferenced data i...

Please sign up or login with your details

Forgot password? Click here to reset