Prediction Error Reduction Function as a Variable Importance Score

01/25/2015
by   Ernest Fokoué, et al.
0

This paper introduces and develops a novel variable importance score function in the context of ensemble learning and demonstrates its appeal both theoretically and empirically. Our proposed score function is simple and more straightforward than its counterpart proposed in the context of random forest, and by avoiding permutations, it is by design computationally more efficient than the random forest variable importance function. Just like the random forest variable importance function, our score handles both regression and classification seamlessly. One of the distinct advantage of our proposed score is the fact that it offers a natural cut off at zero, with all the positive scores indicating importance and significance, while the negative scores are deemed indications of insignificance. An extra advantage of our proposed score lies in the fact it works very well beyond ensemble of trees and can seamlessly be used with any base learners in the random subspace learning context. Our examples, both simulated and real, demonstrate that our proposed score does compete mostly favorably with the random forest score.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2012

rFerns: An Implementation of the Random Ferns Method for General-Purpose Machine Learning

In this paper I present an extended implementation of the Random ferns a...
research
09/29/2020

Selective Cascade of Residual ExtraTrees

We propose a novel tree-based ensemble method named Selective Cascade of...
research
08/05/2022

A Computational Exploration of Emerging Methods of Variable Importance Estimation

Estimating the importance of variables is an essential task in modern ma...
research
03/02/2023

A Notion of Feature Importance by Decorrelation and Detection of Trends by Random Forest Regression

In many studies, we want to determine the influence of certain features ...
research
09/07/2021

Optimizing model-agnostic Random Subspace ensembles

This paper presents a model-agnostic ensemble approach for supervised le...
research
04/08/2017

Interactive Graphics for Visually Diagnosing Forest Classifiers in R

This paper describes structuring data and constructing plots to explore ...
research
03/24/2021

Dimension Reduction Forests: Local Variable Importance using Structured Random Forests

Random forests are one of the most popular machine learning methods due ...

Please sign up or login with your details

Forgot password? Click here to reset