Log In Sign Up

E2E-FS: An End-to-End Feature Selection Method for Neural Networks

by   Brais Cancela, et al.

Classic embedded feature selection algorithms are often divided in two large groups: tree-based algorithms and lasso variants. Both approaches are focused in different aspects: while the tree-based algorithms provide a clear explanation about which variables are being used to trigger a certain output, lasso-like approaches sacrifice a detailed explanation in favor of increasing its accuracy. In this paper, we present a novel embedded feature selection algorithm, called End-to-End Feature Selection (E2E-FS), that aims to provide both accuracy and explainability in a clever way. Despite having non-convex regularization terms, our algorithm, similar to the lasso approach, is solved with gradient descent techniques, introducing some restrictions that force the model to specifically select a maximum number of features that are going to be used subsequently by the classifier. Although these are hard restrictions, the experimental results obtained show that this algorithm can be used with any learning model that is trained using a gradient descent algorithm.


page 1

page 2

page 3

page 4


A scalable saliency-based Feature selection method with instance level information

Classic feature selection techniques remove those features that are eith...

LassoLayer: Nonlinear Feature Selection by Switching One-to-one Links

Along with the desire to address more complex problems, feature selectio...

On the Adversarial Robustness of LASSO Based Feature Selection

In this paper, we investigate the adversarial robustness of feature sele...

Multi-stage Convex Relaxation for Feature Selection

A number of recent work studied the effectiveness of feature selection u...

Sparse Neural Additive Model: Interpretable Deep Learning with Feature Selection via Group Sparsity

Interpretable machine learning has demonstrated impressive performance w...

ET-Lasso: Efficient Tuning of Lasso for High-Dimensional Data

The L1 regularization (Lasso) has proven to be a versatile tool to selec...

Identification of feasible pathway information for c-di-GMP binding proteins in cellulose production

In this paper, we utilize a machine learning approach to identify the si...