RENT – Repeated Elastic Net Technique for Feature Selection

09/27/2020
by   Anna Jenul, et al.
0

In this study we present the RENT feature selection method for binary classification and regression problems. We compare the performance of RENT to a number of other state-of-the-art feature selection methods on eight datasets (six for binary classification and two for regression) to illustrate RENT's performance with regard to prediction and reduction of total number of features. At its core RENT trains an ensemble of unique models using regularized elastic net to select features. Each model in the ensemble is trained with a unique and randomly selected subset from the full training data. From these models one can acquire weight distributions for each feature that contain rich information on the stability of feature selection and from which several adjustable classification criteria may be defined. Moreover, we acquire distributions of class predictions for each sample across many models in the ensemble. Analysis of these distributions may provide useful insight into which samples are more difficult to classify correctly than others. Overall, results from the tested datasets show that RENT not only can compete on-par with the best performing feature selection methods in this study, but also provides valuable insights into the stability of feature selection and sample classification.

READ FULL TEXT
research
02/05/2012

Improving feature selection algorithms using normalised feature histograms

The proposed feature selection method builds a histogram of the most sta...
research
05/01/2018

Adaptive group-regularized logistic elastic net regression

In high-dimensional data settings, additional information on the feature...
research
12/19/2017

Ensemble Models for Detecting Wikidata Vandalism with Stacking - Team Honeyberry Vandalism Detector at WSDM Cup 2017

The WSDM Cup 2017 is a binary classification task for classifying Wikida...
research
01/12/2020

On Feature Interactions Identified by Shapley Values of Binary Classification Games

For feature selection and related problems, we introduce the notion of c...
research
12/30/2020

Elastic Net based Feature Ranking and Selection

Feature selection is important in data representation and intelligent di...
research
10/10/2022

When to encourage using Gaussian regression for feature selection tasks with time-to-event outcome

IMPORTANCE: Feature selection with respect to time-to-event outcomes is ...
research
12/09/2015

Minimally Supervised Feature Selection for Classification (Master's Thesis, University Politehnica of Bucharest)

In the context of the highly increasing number of features that are avai...

Please sign up or login with your details

Forgot password? Click here to reset