A Simultaneous Transformation and Rounding Approach for Modeling Integer-Valued Data

06/27/2019
by   Daniel R. Kowal, et al.
1

We propose a simple yet powerful framework for modeling integer-valued data. The integer-valued data are modeled by Simultaneously Transforming And Rounding (STAR) a continuous-valued process, where the transformation may be known or learned from the data. Implicitly, STAR formalizes the commonly-applied yet incoherent procedure of (i) transforming integer-valued data and subsequently (ii) modeling the transformed data using Gaussian models. Importantly, STAR is well-defined for integer-valued data, which is reflected in predictive accuracy, and is designed to account for zero-inflation, bounded or censored data, and over- or underdispersion. Efficient computation is available via an MCMC algorithm, which provides a mechanism for direct adaptation of successful Bayesian methods for continuous data to the integer-valued data setting. Using the STAR framework, we develop new linear regression models, additive models, and Bayesian Additive Regression Trees (BART) for integer-valued data, which demonstrate substantial improvements in performance relative to existing regression models for a variety of simulated and real datasets.

READ FULL TEXT

page 38

page 41

research
06/27/2019

Simultaneous Transformation and Rounding (STAR) Models for Integer-Valued Data

We propose a simple yet powerful framework for modeling integer-valued d...
research
06/16/2021

Semiparametric count data regression for self-reported mental health

"For how many days during the past 30 days was your mental health not go...
research
02/04/2020

A Regression Tsetlin Machine with Integer Weighted Clauses for Compact Pattern Representation

The Regression Tsetlin Machine (RTM) addresses the lack of interpretabil...
research
01/09/2022

Tree-based Regression for Interval-valued Data

Regression methods for interval-valued data have been increasingly studi...
research
02/23/2020

Bayesian analysis of count-valued, binary-valued, and continuous-valued responses using unknown transformations

Consider the situation where an analyst has a Bayesian statistical model...
research
10/11/2022

Transforming RDF-star to Property Graphs: A Preliminary Analysis of Transformation Approaches – extended version

RDF and property graph models have many similarities, such as using basi...
research
11/03/2020

Graph Enhanced High Dimensional Kernel Regression

In this paper, the flexibility, versatility and predictive power of kern...

Please sign up or login with your details

Forgot password? Click here to reset