Simultaneous Transformation and Rounding (STAR) Models for Integer-Valued Data

06/27/2019
by   Daniel R. Kowal, et al.
0

We propose a simple yet powerful framework for modeling integer-valued data, such as counts, scores, and rounded data. The data-generating process is defined by Simultaneously Transforming and Rounding (STAR) a continuous-valued process, which produces a flexible family of integer-valued distributions capable of modeling zero-inflation, bounded or censored data, and over- or underdispersion. The transformation is modeled as unknown for greater distributional flexibility, while the rounding operation ensures a coherent integer-valued data-generating process. An efficient MCMC algorithm is developed for posterior inference and provides a mechanism for adaptation of successful Bayesian models and algorithms for continuous data to the integer-valued data setting. Using the STAR framework, we design a new Bayesian Additive Regression Tree (BART) model for integer-valued data, which demonstrates impressive predictive distribution accuracy for both synthetic data and a large healthcare utilization dataset. For interpretable regression-based inference, we develop a STAR additive model, which offers greater flexibility and scalability than existing integer-valued models. The STAR additive model is applied to study the recent decline in Amazon river dolphins.

READ FULL TEXT
research
06/27/2019

A Simultaneous Transformation and Rounding Approach for Modeling Integer-Valued Data

We propose a simple yet powerful framework for modeling integer-valued d...
research
02/23/2020

Bayesian analysis of count-valued, binary-valued, and continuous-valued responses using unknown transformations

Consider the situation where an analyst has a Bayesian statistical model...
research
06/16/2021

Semiparametric count data regression for self-reported mental health

"For how many days during the past 30 days was your mental health not go...
research
02/04/2022

First-order integer-valued autoregressive processes with Generalized Katz innovations

A new integer-valued autoregressive process (INAR) with Generalised Lagr...
research
08/31/2023

Locally Adaptive Shrinkage Priors for Trends and Breaks in Count Time Series

Non-stationary count time series characterized by features such as abrup...
research
07/08/2021

Inference and forecasting for continuous-time integer-valued trawl processes and their use in financial economics

This paper develops likelihood-based methods for estimation, inference, ...
research
10/12/2022

FasterRisk: Fast and Accurate Interpretable Risk Scores

Over the last century, risk scores have been the most popular form of pr...

Please sign up or login with your details

Forgot password? Click here to reset