Re-evaluation of the comparative effectiveness of bootstrap-based optimism correction methods in the development of multivariable clinical prediction models

03/06/2020
by   Katsuhiro Iba, et al.
0

Multivariable predictive models are important statistical tools for providing synthetic diagnosis and prognostic algorithms based on multiple patients' characteristics. Their apparent discriminant and calibration measures usually have overestimation biases (known as 'optimism') relative to the actual performances for external populations. Existing statistical evidence and guidelines suggest that three bootstrap-based bias correction methods are preferable in practice, namely Harrell's bias correction and the .632 and .632+ estimators. Although Harrell's method has been widely adopted in clinical studies, simulation-based evidence indicates that the .632+ estimator may perform better than the other two methods. However, there is limited evidence and these methods' actual comparative effectiveness is still unclear. In this article, we conducted extensive simulations to compare the effectiveness of these methods, particularly using the following modern regression models: conventional logistic regression, stepwise variable selections, Firth's penalized likelihood method, ridge, lasso, and elastic-net. Under relatively large sample settings, the three bootstrap-based methods were comparable and performed well. However, all three methods had biases under small sample settings, and the directions and sizes of the biases were inconsistent. In general, the .632+ estimator is recommended, but we provide several notes concerning the operating characteristics of each method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2019

On the variability of regression shrinkage methods for clinical prediction models: simulation study on predictive performance

When developing risk prediction models, shrinkage methods are recommende...
research
02/19/2020

Asymptotically Optimal Bias Reduction for Parametric Models

An important challenge in statistical analysis concerns the control of t...
research
02/18/2022

The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression

Methods to correct class imbalance, i.e. imbalance between the frequency...
research
01/19/2021

On resampling methods for model assessment in penalized and unpenalized logistic regression

Penalized logistic regression methods are frequently used to investigate...
research
08/18/2020

Estimation of causal effects of multiple treatments in healthcare database studies with rare outcomes

The preponderance of large-scale healthcare databases provide abundant o...
research
01/03/2020

Judicial Favoritism of Politicians: Evidence from Small Claims Court

Multiple studies have documented racial, gender, political ideology, or ...

Please sign up or login with your details

Forgot password? Click here to reset