Effects of Influential Points and Sample Size on the Selection and Replicability of Multivariable Fractional Polynomial Models

09/20/2022
by   Willi Sauerbrei, et al.
0

The multivariable fractional polynomial (MFP) procedure combines variable selection with a function selection procedure (FSP). For continuous variables, a closed test procedure is used to decide between no effect, linear, FP1 or FP2 functions. Influential observations (IPs) and small sample size can both have an impact on a selected fractional polynomial model. In this paper, we used simulated data with six continuous and four categorical predictors to illustrate approaches which can help to identify IPs with an influence on function selection and the MFP model. Approaches use leave-one or two-out and two related techniques for a multivariable assessment. In seven subsamples we also investigated the effects of sample size and model replicability. For better illustration, a structured profile was used to provide an overview of all analyses conducted. The results showed that one or more IPs can drive the functions and models selected. In addition, with a small sample size, MFP might not be able to detect non-linear functions and the selected model might differ substantially from the true underlying model. However, if the sample size is sufficient and regression diagnostics are carefully conducted, MFP can be a suitable approach to select variables and functional forms for continuous variables.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2021

Bayesian Information Criterion for Linear Mixed-effects Models

The use of Bayesian information criterion (BIC) in the model selection p...
research
07/01/2019

State-of-the-art in selection of variables and functional forms in multivariable analysis -- outstanding issues

How to select variables and identify functional forms for continuous var...
research
04/16/2020

Combining heterogeneous subgroups with graph-structured variable selection priors for Cox regression

Important objectives in cancer research are the prediction of a patient'...
research
05/25/2021

Kernel Knockoffs Selection for Nonparametric Additive Models

Thanks to its fine balance between model flexibility and interpretabilit...
research
07/13/2022

Fitting Semiparametric Cumulative Probability Models for Big Data

Cumulative probability models (CPMs) are a robust alternative to linear ...
research
02/23/2022

Investigating the effect of binning on causal discovery

Binning (a.k.a. discretization) of numerically continuous measurements i...

Please sign up or login with your details

Forgot password? Click here to reset