Logistic regression with missing responses and predictors: a review of existing approaches and a case study

02/07/2023
by   Susana Rafaela Martins, et al.
0

In this work logistic regression when both the response and the predictor variables may be missing is considered. Several existing approaches are reviewed, including complete case analysis, inverse probability weighting, multiple imputation and maximum likelihood. The methods are compared in a simulation study, which serves to evaluate the bias, the variance and the mean squared error of the estimators for the regression coefficients. In the simulations, the maximum likelihood methodology is the one that presents the best results, followed by multiple imputation with five imputations, which is the second best. The methods are applied to a case study on the obesity for schoolchildren in the municipality of Viana do Castelo, North Portugal, where a logistic regression model is used to predict the International Obesity Task Force (IOTF) indicator from physical examinations and the past values of the obesity status. All the variables in the case study are potentially missing, with gender as the only exception. The results provided by the several methods are in well agreement, indicating the relevance of the past values of IOTF and physical scores for the prediction of obesity. Practical recommendations are given.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2022

Liu-type Shrinkage Estimators for Mixture of Logistic Regressions: An Osteoporosis Study

The logistic regression model is one of the most powerful statistical me...
research
05/04/2022

The Effect of Multiple Imputation of Routine Pathology Variables on Laboratory Diagnosis of Hepatitis C Infection

Pathology tests are central to modern healthcare in terms of diagnosis a...
research
06/10/2019

The Impact of Regularization on High-dimensional Logistic Regression

Logistic regression is commonly used for modeling dichotomous outcomes. ...
research
05/11/2018

Stochastic Approximation EM for Logistic Regression with Missing Values

Logistic regression is a common classification method in supervised lear...
research
01/27/2021

To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets

For finite samples with binary outcomes penalized logistic regression su...
research
11/25/2019

Modeling Variables with a Detection Limit using a Truncated Normal Distribution with Censoring

When data are collected subject to a detection limit, observations below...
research
02/01/2018

Linearized Binary Regression

Probit regression was first proposed by Bliss in 1934 to study mortality...

Please sign up or login with your details

Forgot password? Click here to reset