Semiparametric count data regression for self-reported mental health

06/16/2021
by   Daniel R. Kowal, et al.
0

"For how many days during the past 30 days was your mental health not good?" The responses to this question measure self-reported mental health and can be linked to important covariates in the National Health and Nutrition Examination Survey (NHANES). However, these count variables present major distributional challenges: the data are overdispersed, zero-inflated, bounded by 30, and heaped in five- and seven-day increments. To meet these challenges, we design a semiparametric estimation and inference framework for count data regression. The data-generating process is defined by simultaneously transforming and rounding (STAR) a latent Gaussian regression model. The transformation is estimated nonparametrically and the rounding operator ensures the correct support for the discrete and bounded data. Maximum likelihood estimators are computed using an EM algorithm that is compatible with any continuous data model estimable by least squares. STAR regression includes asymptotic hypothesis testing and confidence intervals, variable selection via information criteria, and customized diagnostics. Simulation studies validate the utility of this framework. STAR is deployed to study the factors associated with self-reported mental health and demonstrates substantial improvements in goodness-of-fit compared to existing count data regression models.

READ FULL TEXT
research
11/08/2018

A New Count Regression Model including Gauss Hypergeometric Function with an application to model demand of health services

In this paper, an alternative count distribution suitable for modeling o...
research
06/27/2019

Simultaneous Transformation and Rounding (STAR) Models for Integer-Valued Data

We propose a simple yet powerful framework for modeling integer-valued d...
research
06/27/2019

A Simultaneous Transformation and Rounding Approach for Modeling Integer-Valued Data

We propose a simple yet powerful framework for modeling integer-valued d...
research
06/20/2018

RSDD-Time: Temporal Annotation of Self-Reported Mental Health Diagnoses

Self-reported diagnosis statements have been widely employed in studying...
research
04/23/2021

Understanding who uses Reddit: Profiling individuals with a self-reported bipolar disorder diagnosis

Recently, research on mental health conditions using public online data,...
research
10/25/2020

Latent Network Estimation and Variable Selection for Compositional Data via Variational EM

Network estimation and variable selection have been extensively studied ...
research
10/02/2019

Indicators of retention in remote digital health studies: A cross-study evaluation of 100,000 participants

Digital technologies such as smartphones are transforming the way scient...

Please sign up or login with your details

Forgot password? Click here to reset