Comprehensive Stepwise Selection for Logistic Regression

06/08/2023
by   Bernd Engelmann, et al.
0

Automated variable selection is widely applied in statistical model development. Algorithms like forward, backward or stepwise selection are available in statistical software packages like R and SAS. Many researchers have criticized the use of these algorithms because the models resulting from automated selection algorithms are not based on theory and tend to be unstable. Furthermore, simulation studies have shown that they often select incorrect variables due to random effects which makes these model building strategies unreliable. In this article, a comprehensive stepwise selection algorithm tailored to logistic regression is proposed. It uses multiple criteria in variable selection instead of relying on one single measure only, like a p-value or Akaike's information criterion, which ensures robustness and soundness of the final outcome. The result of the selection process might not be unambiguous. It might select multiple models that could be considered as statistically equivalent. A simulation study demonstrates the superiority of the proposed variable selection method over available alternatives.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/16/2022

A review and recommendations on variable selection methods in regression models for binary data

The selection of essential variables in logistic regression is vital bec...
research
09/16/2021

On variable selection in joint modeling of mean and dispersion

The joint modeling of mean and dispersion (JMMD) provides an efficient m...
research
09/28/2011

Robust Parametric Classification and Variable Selection by a Minimum Distance Criterion

We investigate a robust penalized logistic regression algorithm based on...
research
11/30/2021

Efficient and robust high-dimensional sparse logistic regression via nonlinear primal-dual hybrid gradient algorithms

Logistic regression is a widely used statistical model to describe the r...
research
02/01/2018

Greedy Active Learning Algorithm for Logistic Regression Models

We study a logistic model-based active learning procedure for binary cla...
research
05/11/2018

Stochastic Approximation EM for Logistic Regression with Missing Values

Logistic regression is a common classification method in supervised lear...

Please sign up or login with your details

Forgot password? Click here to reset