Variable Importance Based Interaction Modeling with an Application on Initial Spread of COVID-19 in China

10/14/2022
by   Jianqiang Zhang, et al.
0

Interaction selection for linear regression models with both continuous and categorical predictors is useful in many fields of modern science, yet very challenging when the number of predictors is relatively large. Existing interaction selection methods focus on finding one optimal model. While attractive properties such as consistency and oracle property have been well established for such methods, they actually may perform poorly in terms of stability for high-dimensional data, and they do not typically deal with categorical predictors. In this paper, we introduce a variable importance based interaction modeling (VIBIM) procedure for learning interactions in a linear regression model with both continuous and categorical predictors. It delivers multiple strong candidate models with high stability and interpretability. Simulation studies demonstrate its good finite sample performance. We apply the VIBIM procedure to a Corona Virus Disease 2019 (COVID-19) data used in Tian et al. (2020) and measure the effects of relevant factors, including transmission control measures on the spread of COVID-19. We show that the VIBIM approach leads to better models in terms of interpretability, stability, reliability and prediction.

READ FULL TEXT

page 35

page 37

research
07/15/2020

A likelihood-based approach for multivariate categorical response regression in high dimensions

We propose a penalized likelihood method to fit the bivariate categorica...
research
03/24/2021

A Two-Stage Variable Selection Approach for Correlated High Dimensional Predictors

When fitting statistical models, some predictors are often found to be c...
research
10/13/2020

The Kendall Interaction Filter for Variable Interaction Screening in Ultra High Dimensional Classification Problems

Accounting for important interaction effects can improve prediction of m...
research
10/25/2021

Sufficient reductions in regression with mixed predictors

Most data sets comprise of measurements on continuous and categorical va...
research
01/05/2021

Weight-of-evidence 2.0 with shrinkage and spline-binning

In many practical applications, such as fraud detection, credit risk mod...
research
06/21/2023

Explaining human body responses in random vibration: Effect of motion direction, sitting posture, and anthropometry

This study investigates the effects of anthropometric attributes, biolog...
research
07/19/2019

Reluctant Interaction Modeling

Including pairwise interactions between the predictors of a regression m...

Please sign up or login with your details

Forgot password? Click here to reset