Fitting a Hurdle Generalized Lambda Distribution to healthcare expenses

12/06/2017
by   Diego Marcondes, et al.
0

In order to fit a model to healthcare expenses data, it is necessary to take into account some of its peculiarities, as the excess of zeros and its skewness, what demands flexible models instead of the usual ones from the exponential family. In this context, the Generalized Lambda Distribution (GLD) is quite useful, as it is highly flexible, for its parameters may be chosen in a way such that it has a given mean, variance, skewness and kurtosis. Furthermore, the GLD approximates very well other distributions, so that it may be employed as a wild-card distribution in many applications. Taking advantage of the GLD flexibility, we develop and apply to healthcare expenses data a hurdle, or two-way, model whose associated distribution is the GLD. We first present a thorough review of the literature about the GLD and then develop hurdle GLD marginal and regression models. Finally, we apply the developed models to a dataset consisting of yearly healthcare expenses, and model it in function of the covariates sex, age and previous year expenses. The fitted models are compared with the kernel density estimate and models based on the Generalised Pareto Distribution (GPD). It is established that the GLD models perform better than the GPD ones in modelling healthcare expenses.

READ FULL TEXT

page 24

page 30

page 33

research
09/10/2022

Distributional regression models for Extended Generalized Pareto distributions

The Extended Generalized Pareto Distribution (EGPD) (Naveau et al. 2016)...
research
10/27/2022

Modelling of discrete extremes through extended versions of discrete generalized Pareto distribution

The statistical modelling of integer-valued extremes such as large avala...
research
03/14/2020

Using Data Assimilation of Mechanistic Models to Estimate Glucose and Insulin Metabolism

Motivation: There is a growing need to integrate mechanistic models of b...
research
12/04/2022

The flexible Gumbel distribution: A new model for inference about the mode

A new unimodal distribution family indexed by the mode and three other p...
research
03/30/2020

Exponential Dispersion Models for Overdispersed Zero-Inflated Count Data

We consider three new classes of exponential dispersion models of discre...
research
11/13/2022

Elliptically-Contoured Tensor-variate Distributions with Application to Improved Image Learning

Statistical analysis of tensor-valued data has largely used the tensor-v...

Please sign up or login with your details

Forgot password? Click here to reset