A Transparent and Nonlinear Method for Variable Selection

07/01/2023
by   Keyao Wang, et al.
0

Variable selection is a procedure to attain the truly important predictors from inputs. Complex nonlinear dependencies and strong coupling pose great challenges for variable selection in high-dimensional data. In addition, real-world applications have increased demands for interpretability of the selection process. A pragmatic approach should not only attain the most predictive covariates, but also provide ample and easy-to-understand grounds for removing certain covariates. In view of these requirements, this paper puts forward an approach for transparent and nonlinear variable selection. In order to transparently decouple information within the input predictors, a three-step heuristic search is designed, via which the input predictors are grouped into four subsets: the relevant to be selected, and the uninformative, redundant, and conditionally independent to be removed. A nonlinear partial correlation coefficient is introduced to better identify the predictors which have nonlinear functional dependence with the response. The proposed method is model-free and the selected subset can be competent input for commonly used predictive models. Experiments demonstrate the superior performance of the proposed method against the state-of-the-art baselines in terms of prediction accuracy and model interpretability.

READ FULL TEXT

page 4

page 6

page 15

page 16

page 21

research
12/28/2021

Variable Selection Using Bayesian Additive Regression Trees

Variable selection is an important statistical problem. This problem bec...
research
03/24/2021

A Two-Stage Variable Selection Approach for Correlated High Dimensional Predictors

When fitting statistical models, some predictors are often found to be c...
research
06/18/2015

Optimal model-free prediction from multivariate time series

Forecasting a time series from multivariate predictors constitutes a cha...
research
01/20/2016

Nonlinear variable selection with continuous outcome: a nonparametric incremental forward stagewise approach

We present a method of variable selection for the situation where some p...
research
07/20/2020

Variable Selection in Macroeconomic Forecasting with Many Predictors

In the data-rich environment, using many economic predictors to forecast...
research
03/27/2022

Interpretable Machine Learning Models for Modal Split Prediction in Transportation Systems

Modal split prediction in transportation networks has the potential to s...
research
05/11/2020

Interpretable random forest models through forward variable selection

Random forest is a popular prediction approach for handling high dimensi...

Please sign up or login with your details

Forgot password? Click here to reset