Nonparametric Feature Selection by Random Forests and Deep Neural Networks

01/18/2022
by   Xiaojun Mao, et al.
0

Random forests are a widely used machine learning algorithm, but their computational efficiency is undermined when applied to large-scale datasets with numerous instances and useless features. Herein, we propose a nonparametric feature selection algorithm that incorporates random forests and deep neural networks, and its theoretical properties are also investigated under regularity conditions. Using different synthetic models and a real-world example, we demonstrate the advantage of the proposed algorithm over other alternatives in terms of identifying useful features, avoiding useless ones, and the computation efficiency. Although the algorithm is proposed using standard random forests, it can be widely adapted to other machine learning algorithms, as long as features can be sorted accordingly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2020

The best way to select features?

Feature selection in machine learning is subject to the intrinsic random...
research
06/26/2019

A Debiased MDI Feature Importance Measure for Random Forests

Tree ensembles such as Random Forests have achieved impressive empirical...
research
02/15/2023

A model-free feature selection technique of feature screening and random forest based recursive feature elimination

In this paper, we propose a model-free feature selection method for ultr...
research
08/14/2020

Feature Selection Methods for Cost-Constrained Classification in Random Forests

Cost-sensitive feature selection describes a feature selection problem, ...
research
03/05/2022

Fuzzy Forests For Feature Selection in High-Dimensional Survey Data: An Application to the 2020 U.S. Presidential Election

An increasingly common methodological issue in the field of social scien...
research
06/24/2014

Reliable ABC model choice via random forests

Approximate Bayesian computation (ABC) methods provide an elaborate appr...
research
05/25/2021

SHAFF: Fast and consistent SHApley eFfect estimates via random Forests

Interpretability of learning algorithms is crucial for applications invo...

Please sign up or login with your details

Forgot password? Click here to reset