Analyzing covariate clustering effects in healthcare cost subgroups: insights and applications for prediction

03/10/2023
by   Zhengxiao Li, et al.
0

Healthcare cost prediction is a challenging task due to the high-dimensionality and high correlation among covariates. Additionally, the skewed, heavy-tailed, and often multi-modal nature of cost data can complicate matters further due to unobserved heterogeneity. In this study, we propose a novel framework for finite mixture regression models that incorporates covariate clustering methods to better account for the effects of clustered covariates on subgroups of the outcome, which enables a more accurate characterization of the complex distribution of the data. The proposed framework can be formulated as a convex optimization problem with an additional penalty term based on the prior similarity of the covariates. To efficiently solve this optimization problem, a specialized EM-ADMM algorithm is proposed that integrates the alternating direction multiplicative method (ADMM) into the iterative process of the expectation-maximizing (EM) algorithm. The convergence of the algorithm and the efficiency of the covariate clustering method are verified using simulation data, and the superiority of the approach over traditional regression techniques is demonstrated using two real Chinese medical expenditure datasets. Our empirical results provide valuable insights into the complex network graph of the covariates and can inform business practices, such as the design and pricing of medical insurance products.

READ FULL TEXT
research
03/05/2019

Convex Covariate Clustering for Classification

Clustering, like covariate selection for classification, is an important...
research
05/26/2021

Flexible Bayesian modelling of concomitant covariate effects in mixture models

Mixture models provide a useful tool to account for unobserved heterogen...
research
10/13/2014

Convex Modeling of Interactions with Strong Heredity

We consider the task of fitting a regression model involving interaction...
research
03/20/2023

An ADMM approach for multi-response regression with overlapping groups and interaction effects

In this paper, we consider the regularized multi-response regression pro...
research
05/30/2019

Clustered Gaussian Graphical Model via Symmetric Convex Clustering

Knowledge of functional groupings of neurons can shed light on structure...
research
12/01/2022

Robust multi-outcome regression with correlated covariate blocks using fused LAD-lasso

Lasso is a popular and efficient approach to simultaneous estimation and...
research
11/24/2022

Convergence Analysis of Stochastic Kriging-Assisted Simulation with Random Covariates

We consider performing simulation experiments in the presence of covaria...

Please sign up or login with your details

Forgot password? Click here to reset