Futility Analysis in the Cross-Validation of Machine Learning Models

05/27/2014
by   Max Kuhn, et al.
0

Many machine learning models have important structural tuning parameters that cannot be directly estimated from the data. The common tactic for setting these parameters is to use resampling methods, such as cross--validation or the bootstrap, to evaluate a candidate set of values and choose the best based on some pre--defined criterion. Unfortunately, this process can be time consuming. However, the model tuning process can be streamlined by adaptively resampling candidate values so that settings that are clearly sub-optimal can be discarded. The notion of futility analysis is introduced in this context. An example is shown that illustrates how adaptive resampling can be used to reduce training time. Simulation studies are used to understand how the potential speed--up is affected by parallel processing techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2018

Using J-K fold Cross Validation to Reduce Variance When Tuning NLP Models

K-fold cross validation (CV) is a popular method for estimating the true...
research
03/03/2021

Machine Learning using Stata/Python

We present two related Stata modules, r_ml_stata and c_ml_stata, for fit...
research
09/05/2019

On the discriminative power of Hyper-parameters in Cross-Validation and how to choose them

Hyper-parameters tuning is a crucial task to make a model perform at its...
research
05/03/2021

Model Averaging by Cross-validation for Partially Linear Functional Additive Models

We consider averaging a number of candidate models to produce a predicti...
research
03/18/2020

Bootstrap Bias Corrected Cross Validation applied to Super Learning

Super learner algorithm can be applied to combine results of multiple ba...
research
11/26/2019

The Early Roots of Statistical Learning in the Psychometric Literature: A review and two new results

Machine and Statistical learning techniques become more and more importa...
research
06/06/2017

Shape Parameter Estimation

Performance of machine learning approaches depends strongly on the choic...

Please sign up or login with your details

Forgot password? Click here to reset