Dealing with Logs and Zeros in Regression Models

03/22/2022
by   Christophe Bellégo, et al.
0

Log-linear models are prevalent in empirical research. Yet, how to handle zeros in the dependent variable remains an unsettled issue. This article clarifies it and addresses the log of zero by developing a new family of estimators called iterated Ordinary Least Squares (iOLS). This family nests standard approaches such as log-linear and Poisson regressions, offers several computational advantages, and corresponds to the correct way to perform the popular log(Y+1) transformation. We extend it to the endogenous regressor setting (i2SLS) and overcome other common issues with Poisson models, such as controlling for many fixed-effects. We also develop specification tests to help researchers select between alternative estimators. Finally, our methods are illustrated through numerical simulations and replications of landmark publications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2018

A Least Squares Estimation of a Hybrid log-Poisson Regression and its Goodness of Fit for Optimal Loss Reserves in Insurance

In this article, the parameters of a hybrid log-linear model (log-Poisso...
research
11/02/2017

Collapsibility of marginal models for categorical data

We consider marginal log-linear models for parameterizing distributions ...
research
10/26/2018

Estimating grouped data models with a binary dependent variable and fixed effect via logit vs OLS: the impact of dropped units

This letter deals with a very simple issue: if we have grouped data with...
research
01/30/2023

A Simulation Study of the Performance of Statistical Models for Count Outcomes with Excessive Zeros

Background: Outcome measures that are count variables with excessive zer...
research
04/03/2020

Composite mixture of log-linear models for categorical data

Multivariate categorical data are routinely collected in many applicatio...
research
04/26/2019

Poisson PCA: Poisson Measurement Error corrected PCA, with Application to Microbiome Data

In this paper, we study the problem of computing a Principal Component A...
research
02/08/2019

Fast Sequence Segmentation using Log-Linear Models

Sequence segmentation is a well-studied problem, where given a sequence ...

Please sign up or login with your details

Forgot password? Click here to reset