State-of-the-art in selection of variables and functional forms in multivariable analysis -- outstanding issues

by   Willi Sauerbrei, et al.

How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc 'traditional' approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for continuous variables were first suggested many years ago. More recently, many alternative approaches to address these two challenges have been proposed, but knowledge of their properties and meaningful comparisons between them are scarce. To define a state-of-the-art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge many outstanding issues in multivariable modelling remain. Our main aims are to identify and illustrate such gaps in the literature and present them at a moderate technical level to the wide community of practitioners, researchers and students of statistics. We briefly discuss general issues in building descriptive regression models, strategies for variable selection, different ways of choosing functional forms for continuous variables, and methods for combining the selection of variables and functions. We discuss two examples, taken from the medical literature, to illustrate problems in the practice of modelling. Our overview revealed that there is not yet enough evidence on which to base recommendations for the selection of variables and functional forms in multivariable analysis. Such evidence may come from comparisons between alternative methods. In particular, we highlight seven important topics that require further investigation and make suggestions for the direction of further research.



There are no comments yet.


page 1

page 2

page 3

page 4


A review and recommendations on variable selection methods in regression models for binary data

The selection of essential variables in logistic regression is vital bec...

Nonlinear variable selection with continuous outcome: a nonparametric incremental forward stagewise approach

We present a method of variable selection for the situation where some p...

An update on statistical boosting in biomedicine

Statistical boosting algorithms have triggered a lot of research during ...

Variable selection in Functional Additive Regression Models

This paper considers the problem of variable selection when some of the ...

Variable selection with multiply-imputed datasets: choosing between stacked and grouped methods

Penalized regression methods, such as lasso and elastic net, are used in...

The Knowledge Graph for Macroeconomic Analysis with Alternative Big Data

The current knowledge system of macroeconomics is built on interactions ...

Inference Networks and the Evaluation of Evidence: Alternative Analyses

Inference networks have a variety of important uses and are constructed ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.