Comparing methods addressing multi-collinearity when developing prediction models

01/05/2021
by   Artuur M. Leeuwenberg, et al.
0

Clinical prediction models are developed widely across medical disciplines. When predictors in such models are highly collinear, unexpected or spurious predictor-outcome associations may occur, thereby potentially reducing face-validity and explainability of the prediction model. Collinearity can be dealt with by exclusion of collinear predictors, but when there is no a priori motivation (besides collinearity) to include or exclude specific predictors, such an approach is arbitrary and possibly inappropriate. We compare different methods to address collinearity, including shrinkage, dimensionality reduction, and constrained optimization. The effectiveness of these methods is illustrated via simulations. In the conducted simulations, no effect of collinearity was observed on predictive outcomes. However, a negative effect of collinearity on the stability of predictor selection was found, affecting all compared methods, but in particular methods that perform strong predictor selection (e.g., Lasso).

READ FULL TEXT

page 1

page 23

page 24

page 25

page 26

page 27

page 30

page 32

research
06/27/2018

Impact of predictor measurement heterogeneity across settings on performance of prediction models: a measurement error perspective

Clinical prediction models have an important role in contemporary medici...
research
02/21/2020

Directed Acyclic Graphs and causal thinking in clinical risk prediction modeling

Background: In epidemiology, causal inference and prediction modeling me...
research
12/02/2020

Real-time imputation of missing predictor values in clinical practice

Use of prediction models is widely recommended by clinical guidelines, b...
research
07/02/2015

Identification of stable models via nonparametric prediction error methods

A new Bayesian approach to linear system identification has been propose...
research
02/10/2022

The leap to ordinal: functional prognosis after traumatic brain injury using artificial intelligence

When a patient is admitted to the intensive care unit (ICU) after a trau...
research
10/07/2018

Sparse Regression with Multi-type Regularized Feature Modeling

Within the statistical and machine learning literature, regularization t...
research
07/05/2020

Handling high correlations in the feature gene selection using Single-Cell RNA sequencing data

Motivation: Selecting feature genes and predicting cells' phenotype are ...

Please sign up or login with your details

Forgot password? Click here to reset