Tuning in ridge logistic regression to solve separation

11/30/2020
by   Hana Šinkovec, et al.
0

Separation in logistic regression is a common problem causing failure of the iterative estimation process when finding maximum likelihood estimates. Firth's correction (FC) was proposed as a solution, providing estimates also in presence of separation. In this paper we evaluate whether ridge regression (RR) could be considered instead, specifically, if it could reduce the mean squared error (MSE) of coefficient estimates in comparison to FC. In RR the tuning parameter determining the penalty strength is usually obtained by minimizing some measure of the out-of-sample prediction error or information criterion. However, in presence of separation tuning these measures can yield an optimized value of zero (no shrinkage), and hence cannot provide a universal solution. We derive a new bootstrap based tuning criterion B that always leads to shrinkage. Moreover, we demonstrate how valid inference can be obtained by combining resampled profile penalized likelihood functions. Our approach is illustrated in an example from oncology and its performance is compared to FC in a simulation study. Our simulations showed that in analyses of small and sparse datasets and with many correlated covariates B-tuned RR can yield coefficient estimates with MSE smaller than FC and confidence intervals that approximately achieve nominal coverage probabilities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2021

To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets

For finite samples with binary outcomes penalized logistic regression su...
research
12/13/2018

Split regression modeling

In this note we study the benefits of splitting variables variables for ...
research
12/05/2018

Jeffreys' prior, finiteness and shrinkage in binomial-response generalized linear models

This paper studies the finiteness properties of a reduced-bias estimator...
research
09/28/2021

Penalized Likelihood Methods for Modeling of Reading Count Data

The paper considers parameter estimation in count data models using pena...
research
03/09/2021

The Efficient Shrinkage Path: Maximum Likelihood of Minimum MSE Risk

A new generalized ridge regression shrinkage path is proposed that is as...
research
04/22/2019

A Maximum Entropy Procedure to Solve Likelihood Equations

In this article we provide initial findings regarding the problem of sol...
research
10/31/2020

Smoothly Adaptively Centered Ridge Estimator

With a focus on linear models with smooth functional covariates, we prop...

Please sign up or login with your details

Forgot password? Click here to reset