Variable Importance Assessments and Backward Variable Selection for High-Dimensional Data

06/18/2018
by   Liuhua Peng, et al.
0

Variable selection in high-dimensional scenarios is of great interested in statistics. One application involves identifying differentially expressed genes in genomic analysis. Existing methods for addressing this problem have some limits or disadvantages. In this paper, we propose distance based variable importance measures to deal with these problems, which is inspired by the Multi-Response Permutation Procedure (MRPP). The proposed variable importance assessments can effectively measure the importance of an individual dimension by quantifying its influence on the differences between multivariate distributions. A backward selection algorithm is developed that can be used in high-dimensional variable selection to discover important variables. Both simulations and real data applications demonstrate that our proposed method enjoys good properties and has advantages over other methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2021

ENNS: Variable Selection, Regression, Classification and Deep Neural Network for High-Dimensional Data

High-dimensional, low sample-size (HDLSS) data problems have been a topi...
research
12/28/2021

Variable Selection Using Bayesian Additive Regression Trees

Variable selection is an important statistical problem. This problem bec...
research
11/12/2018

Global sensitivity analysis for optimization with variable selection

The optimization of high dimensional functions is a key issue in enginee...
research
11/29/2021

A Fast Non-parametric Approach for Causal Structure Learning in Polytrees

We study the problem of causal structure learning with no assumptions on...
research
09/27/2018

Auto-Encoding Knockoff Generator for FDR Controlled Variable Selection

A new statistical procedure (Model-X candes2018) has provided a way to i...
research
11/22/2017

Sparse Variable Selection on High Dimensional Heterogeneous Data with Tree Structured Responses

We consider the problem of sparse variable selection on high dimension h...
research
12/05/2019

Asymptotic Unbiasedness of the Permutation Importance Measure in Random Forest Models

Variable selection in sparse regression models is an important task as a...

Please sign up or login with your details

Forgot password? Click here to reset