Stochastic Approximation EM for Logistic Regression with Missing Values

05/11/2018
by   Wei Jiang, et al.
0

Logistic regression is a common classification method in supervised learning. Surprisingly, there are very few solutions for performing it and selecting variables in the presence of missing values. We propose a stochastic approximation version of the EM algorithm based on Metropolis-Hasting sampling, to perform statistical inference for logistic regression with incomplete data. We propose a complete approach, including the estimation of parameters and their variance, derivation of confidence intervals, a model selection procedure, and a method for prediction on test sets with missing values. The method is computationally efficient, and its good coverage and variable selection properties are demonstrated in a simulation study. We then illustrate the method on a dataset of polytraumatized patients from Paris hospitals to predict the occurrence of hemorrhagic shock, a leading cause of early preventable death in severe trauma cases. The aim is to consolidate the current red flag procedure, a binary alert identifying patients with a high risk of severe hemorrhage. The methodology is implemented in the R package misaem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2022

Statistical Inference for Genetic Relatedness Based on High-Dimensional Logistic Regression

This paper studies the problem of statistical inference for genetic rela...
research
09/14/2019

Adaptive Bayesian SLOPE – High-dimensional Model Selection with Missing Values

The selection of variables with high-dimensional and missing data is a m...
research
02/07/2023

Logistic regression with missing responses and predictors: a review of existing approaches and a case study

In this work logistic regression when both the response and the predicto...
research
06/08/2023

Comprehensive Stepwise Selection for Logistic Regression

Automated variable selection is widely applied in statistical model deve...
research
05/22/2018

Regression Analysis of Proportion Outcomes with Random Effects

A regression method for proportional, or fractional, data with mixed eff...
research
05/29/2022

A Conditional Randomization Test for Sparse Logistic Regression in High-Dimension

Identifying the relevant variables for a classification model with corre...
research
06/26/2015

Convolutional networks and learning invariant to homogeneous multiplicative scalings

The conventional classification schemes -- notably multinomial logistic ...

Please sign up or login with your details

Forgot password? Click here to reset