Test Error Estimation after Model Selection Using Validation Error

01/09/2018
by   Leying Guan, et al.
0

When performing supervised learning with the model selected using validation error from sample splitting and cross validation, the minimum value of the validation error can be biased downward. We propose two simple methods that use the errors produced in the validating step to estimate the test error after model selection, and we focus on the situations where we select the model by minimizing the validation error and the randomized validation error. Our methods do not require model refitting, and the additional computational cost is negligible. In the setting of sample splitting, we show that, the proposed test error estimates have biases of size o(1/√(n)) under suitable assumptions. We also propose to use the bootstrap to construct confidence intervals for the test error based on this result. We apply our proposed methods to a number of simulations and examine their performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2022

Confidence intervals for the Cox model test error from cross-validation

Cross-validation (CV) is one of the most widely used techniques in stati...
research
07/01/2023

Bootstrapping the Cross-Validation Estimate

Cross-validation is a widely used technique for evaluating the performan...
research
02/10/2023

The out-of-sample R^2: estimation and inference

Out-of-sample prediction is the acid test of predictive models, yet an i...
research
10/24/2022

Post-Selection Confidence Bounds for Prediction Performance

In machine learning, the selection of a promising model from a potential...
research
01/20/2022

Nonnested model selection based on empirical likelihood

We propose an empirical likelihood ratio test for nonparametric model se...
research
01/06/2021

Cross-Validation and Uncertainty Determination for Randomized Neural Networks with Applications to Mobile Sensors

Randomized artificial neural networks such as extreme learning machines ...
research
09/07/2023

Efficient estimation and correction of selection-induced bias with order statistics

Model selection aims to identify a sufficiently well performing model th...

Please sign up or login with your details

Forgot password? Click here to reset