Feature Selection via Regularized Trees

01/07/2012
by   Houtao Deng, et al.
0

We propose a tree regularization framework, which enables many tree models to perform feature selection efficiently. The key idea of the regularization framework is to penalize selecting a new feature for splitting when its gain (e.g. information gain) is similar to the features used in previous splits. The regularization framework is applied on random forest and boosted trees here, and can be easily applied to other tree models. Experimental studies show that the regularized trees can select high-quality feature subsets with regard to both strong and weak classifiers. Because tree models can naturally deal with categorical and numerical variables, missing values, different scales between variables, interactions and nonlinearities etc., the tree regularization framework provides an effective and efficient feature selection solution for many practical problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2020

Generalizing Gain Penalization for Feature Selection in Tree-based Models

We develop a new approach for feature selection via gain penalization in...
research
11/08/2022

Individualized and Global Feature Attributions for Gradient Boosted Trees in the Presence of ℓ_2 Regularization

While ℓ_2 regularization is widely used in training gradient boosted tre...
research
09/04/2017

Random Subspace with Trees for Feature Selection Under Memory Constraints

Dealing with datasets of very high dimension is a major challenge in mac...
research
06/30/2021

Efficient Detection of Botnet Traffic by features selection and Decision Trees

Botnets are one of the online threats with the biggest presence, causing...
research
06/15/2018

Crime Event Embedding with Unsupervised Feature Selection

We present a novel event embedding algorithm for crime data that can joi...
research
04/26/2020

Classification Trees for Imbalanced and Sparse Data: Surface-to-Volume Regularization

Classification algorithms face difficulties when one or more classes hav...
research
04/14/2021

Regularized regression on compositional trees with application to MRI analysis

A compositional tree refers to a tree structure on a set of random varia...

Please sign up or login with your details

Forgot password? Click here to reset