Stability of clinical prediction models developed using statistical or machine learning methods

11/02/2022
by   Richard D. Riley, et al.
0

Clinical prediction models estimate an individual's risk of a particular health outcome, conditional on their values of multiple predictors. A developed model is a consequence of the development dataset and the chosen model building strategy, including the sample size, number of predictors and analysis method (e.g., regression or machine learning). Here, we raise the concern that many models are developed using small datasets that lead to instability in the model and its predictions (estimated risks). We define four levels of model stability in estimated risks moving from the overall mean to the individual level. Then, through simulation and case studies of statistical and machine learning approaches, we show instability in a model's estimated risks is often considerable, and ultimately manifests itself as miscalibration of predictions in new data. Therefore, we recommend researchers should always examine instability at the model development stage and propose instability plots and measures to do so. This entails repeating the model building steps (those used in the development of the original prediction model) in each of multiple (e.g., 1000) bootstrap samples, to produce multiple bootstrap models, and then deriving (i) a prediction instability plot of bootstrap model predictions (y-axis) versus original model predictions (x-axis), (ii) a calibration instability plot showing calibration curves for the bootstrap models in the original sample; and (iii) the instability index, which is the mean absolute difference between individuals' original and bootstrap model predictions. A case study is used to illustrate how these instability assessments help reassure (or not) whether model predictions are likely to be reliable (or not), whilst also informing a model's critical appraisal (risk of bias rating), fairness assessment and further validation requirements.

READ FULL TEXT

page 12

page 21

research
09/18/2023

Effective sample size: a measure of individual uncertainty in predictions

Clinical prediction models are estimated using a sample of limited size ...
research
02/18/2016

What is the distribution of the number of unique original items in a bootstrap sample?

Sampling with replacement occurs in many settings in machine learning, n...
research
01/21/2021

A scalable approach for developing clinical risk prediction applications in different hospitals

Objective: Machine learning algorithms are now widely used in predicting...
research
11/30/2020

Predictive case control designs for modification learning

Prediction models for clinical outcomes may be developed using a source ...
research
10/16/2021

Minding non-collapsibility of odds ratios when recalibrating risk prediction models

In clinical prediction modeling, model updating refers to the practice o...
research
06/20/2021

Uncertainty and Value of Perfect Information in Risk Prediction Modeling

Background: Predicted probabilities from a risk prediction model are ine...
research
01/18/2017

A Machine Learning Alternative to P-values

This paper presents an alternative approach to p-values in regression se...

Please sign up or login with your details

Forgot password? Click here to reset