Firth's logistic regression with rare events: accurate effect estimates AND predictions?

01/19/2021
by   Rainer Puhr, et al.
0

Firth-type logistic regression has become a standard approach for the analysis of binary outcomes with small samples. Whereas it reduces the bias in maximum likelihood estimates of coefficients, bias towards 1/2 is introduced in the predicted probabilities. The stronger the imbalance of the outcome, the more severe is the bias in the predicted probabilities. We propose two simple modifications of Firth-type logistic regression resulting in unbiased predicted probabilities. The first corrects the predicted probabilities by a post-hoc adjustment of the intercept. The other is based on an alternative formulation of Firth-types estimation as an iterative data augmentation procedure. Our suggested modification consists in introducing an indicator variable which distinguishes between original and pseudo observations in the augmented data. In a comprehensive simulation study these approaches are compared to other attempts to improve predictions based on Firth-type penalization and to other published penalization strategies intended for routine use. For instance, we consider a recently suggested compromise between maximum likelihood and Firth-type logistic regression. Simulation results are scrutinized both with regard to prediction and regression coefficients. Finally, the methods considered are illustrated and compared for a study on arterial closure devices in minimally invasive cardiac surgery.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/19/2021

On resampling methods for model assessment in penalized and unpenalized logistic regression

Penalized logistic regression methods are frequently used to investigate...
research
04/22/2019

A Maximum Entropy Procedure to Solve Likelihood Equations

In this article we provide initial findings regarding the problem of sol...
research
02/17/2022

Conjugate priors and bias reduction for logistic regression models

Logistic regression models for binomial responses are routinely used in ...
research
01/27/2021

To tune or not to tune, a case study of ridge logistic regression in small or sparse datasets

For finite samples with binary outcomes penalized logistic regression su...
research
08/25/2023

Calibration plots for multistate risk predictions models: an overview and simulation comparing novel approaches

Introduction. There is currently no guidance on how to assess the calibr...
research
03/19/2018

A modern maximum-likelihood theory for high-dimensional logistic regression

Every student in statistics or data science learns early on that when th...
research
04/05/2023

Distributed Logistic Regression for Massive Data with Rare Events

Large-scale rare events data are commonly encountered in practice. To ta...

Please sign up or login with your details

Forgot password? Click here to reset