Zero-inflated generalized extreme value regression model for binary data and application in health study

05/02/2021
by   Aba Diop, et al.
0

Logistic regression model is widely used in many studies to investigate the relationship between a binary response variable Y and a set of potential predictors 𝐗. The binary response may represent, for example, the occurrence of some outcome of interest (Y=1 if the outcome occurred and Y=0 otherwise). When the dependent variable Y represents a rare event, the logistic regression model shows relevant drawbacks. In order to overcome these drawbacks we propose the Generalized Extreme Value (GEV) regression model. In particular, we suggest the quantile function of the GEV distribution as link function, so our attention is focused on the tail of the response curve for values close to one. A sample of observations is said to contain a cure fraction when a proportion of the study subjects (the so-called cured individuals, as opposed to the susceptibles) cannot experience the outcome of interest. One problem arising then is that it is usually unknown who are the cured and the susceptible subjects, unless the outcome of interest has been observed. In these settings, a logistic regression analysis of the relationship between 𝐗 and Y among the susceptibles is no more straightforward. We develop a maximum likelihood estimation procedure for this problem, based on the joint modeling of the binary response of interest and the cure status. We investigate the identifiability of the resulting model. Then, we conduct a simulation study to investigate its finite-sample behavior, and application to real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2021

Parametric bootstrapping in a generalized extreme value regression model for binary response

Generalized extreme value (GEV) regression is often more adapted when we...
research
01/03/2021

Binary Outcome Copula Regression Model with Sampling Gradient Fitting

Use copula to model dependency of variable extends multivariate gaussian...
research
11/17/2022

Mediation analysis with case-control sampling: Identification and estimation in the presence of a binary mediator

With reference to a stratified case-control procedure based on a binary ...
research
06/03/2019

Multiplicative Effect Modeling: The General Case

Generalized linear models, such as logistic regression, are widely used ...
research
01/13/2020

Generalized Linear Models for Longitudinal Data with Biased Sampling Designs: A Sequential Offsetted Regressions Approach

Biased sampling designs can be highly efficient when studying rare (bina...
research
10/28/2021

Robust model-based estimation for binary outcomes in genomics studies

In quantitative genetics, statistical modeling techniques are used to fa...
research
01/19/2021

On resampling methods for model assessment in penalized and unpenalized logistic regression

Penalized logistic regression methods are frequently used to investigate...

Please sign up or login with your details

Forgot password? Click here to reset