A model-free feature selection technique of feature screening and random forest based recursive feature elimination

02/15/2023
by   Siwei Xia, et al.
0

In this paper, we propose a model-free feature selection method for ultra-high dimensional data with mass features. This is a two phases procedure that we propose to use the fused Kolmogorov filter with the random forest based RFE to remove model limitations and reduce the computational complexity. The method is fully nonparametric and can work with various types of datasets. It has several appealing characteristics, i.e., accuracy, model-free, and computational efficiency, and can be widely used in practical problems, such as multiclass classification, nonparametric regression, and Poisson regression, among others. We show that the proposed method is selection consistent and L_2 consistent under weak regularity conditions. We further demonstrate the superior performance of the proposed method over other existing methods by simulations and real data examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2022

Deep Feature Screening: Feature Selection for Ultra High-Dimensional Data via Deep Neural Networks

The applications of traditional statistical feature selection methods to...
research
07/31/2017

Consistent Nonparametric Different-Feature Selection via the Sparsest k-Subgraph Problem

Two-sample feature selection is the problem of finding features that des...
research
06/23/2022

High-dimensional Variable Screening via Conditional Martingale Difference Divergence

Variable screening has been a useful research area that helps to deal wi...
research
01/18/2022

Nonparametric Feature Selection by Random Forests and Deep Neural Networks

Random forests are a widely used machine learning algorithm, but their c...
research
05/25/2023

Feature space reduction method for ultrahigh-dimensional, multiclass data: Random forest-based multiround screening (RFMS)

In recent years, numerous screening methods have been published for ultr...
research
08/06/2020

Heterogeneous Idealization of Ion Channel Recordings – Open Channel Noise

We propose a new model-free segmentation method for idealizing ion chann...
research
05/08/2022

On Exact Feature Screening in Ultrahigh-dimensional Binary Classification

We propose a new model-free feature screening method based on energy dis...

Please sign up or login with your details

Forgot password? Click here to reset