Information Borrowing in Regression Models

01/09/2022
by   Amy Zhang, et al.
0

Model development often takes data structure, subject matter considerations, model assumptions, and goodness of fit into consideration. To diagnose issues with any of these factors, it can be helpful to understand regression model estimates at a more granular level. We propose a new method for decomposing point estimates from a regression model via weights placed on data clusters. The weights are informed only by the model specification and data availability and thus can be used to explicitly link the effects of data imbalance and model assumptions to actual model estimates. The weight matrix has been understood in linear models as the hat matrix in the existing literature. We extend it to Bayesian hierarchical regression models that incorporate prior information and complicated dependence structures through the covariance among random effects. We show that the model weights, which we call borrowing factors, generalize shrinkage and information borrowing to all regression models. In contrast, the focus of the hat matrix has been mainly on the diagonal elements indicating the amount of leverage. We also provide metrics that summarize the borrowing factors and are practically useful. We present the theoretical properties of the borrowing factors and associated metrics and demonstrate their usage in two examples. By explicitly quantifying borrowing and shrinkage, researchers can better incorporate domain knowledge and evaluate model performance and the impacts of data properties such as data imbalance or influential points.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2022

Weighted Clustered Coefficients Regression Models in Survey Sampling

Regression models are studied in survey data and are widely used to cons...
research
10/06/2019

R-optimal designs for multi-response regression models with multi-factors

We investigate R-optimal designs for multi-response regression models wi...
research
06/28/2020

Improved Small Area Estimation via Compromise Regression Weights

Shrinkage estimates of small domain parameters typically utilize a combi...
research
09/11/2023

Liu-type Shrinkage Estimators for Mixture of Poisson Regressions with Experts: A Heart Disease Study

Count data play a critical role in medical research, such as heart disea...
research
11/29/2020

Approximate Cross-validated Mean Estimates for Bayesian Hierarchical Regression Models

We introduce a novel procedure for obtaining cross-validated predictive ...
research
11/02/2020

Gradient Boosting for Linear Mixed Models

Gradient boosting from the field of statistical learning is widely known...
research
11/13/2020

Formation of Regression Model for Analysis of Complex Systems Using Methodology of Genetic Algorithms

This study presents the approach to analyzing the evolution of an arbitr...

Please sign up or login with your details

Forgot password? Click here to reset