Feature Gradients: Scalable Feature Selection via Discrete Relaxation

08/27/2019
by   Rishit Sheth, et al.
0

In this paper we introduce Feature Gradients, a gradient-based search algorithm for feature selection. Our approach extends a recent result on the estimation of learnability in the sublinear data regime by showing that the calculation can be performed iteratively (i.e., in mini-batches) and in linear time and space with respect to both the number of features D and the sample size N . This, along with a discrete-to-continuous relaxation of the search domain, allows for an efficient, gradient-based search algorithm among feature subsets for very large datasets. Crucially, our algorithm is capable of finding higher-order correlations between features and targets for both the N > D and N < D regimes, as opposed to approaches that do not consider such interactions and/or only consider one regime. We provide experimental demonstration of the algorithm in small and large sample-and feature-size settings.

READ FULL TEXT

page 5

page 6

research
01/13/2019

Gradient Boosted Feature Selection

A feature selection algorithm should ideally satisfy four conditions: re...
research
12/06/2017

S-Shaped vs. V-Shaped Transfer Functions for Antlion Optimization Algorithm in Feature Selection Problems

Feature selection is an important preprocessing step for classification ...
research
02/26/2023

Data-Centric AI: Deep Generative Differentiable Feature Selection via Discrete Subsetting as Continuous Embedding Space Optimization

Feature Selection (FS), such as filter, wrapper, and embedded methods, a...
research
05/31/2019

Efficient Forward Architecture Search

We propose a neural architecture search (NAS) algorithm, Petridish, to i...
research
08/01/2023

Copula for Instance-wise Feature Selection and Ranking

Instance-wise feature selection and ranking methods can achieve a good s...
research
09/20/2022

A Tent Lévy Flying Sparrow Search Algorithm for Feature Selection: A COVID-19 Case Study

The "Curse of Dimensionality" induced by the rapid development of inform...
research
02/18/2019

Sparse Regression: Scalable algorithms and empirical performance

In this paper, we review state-of-the-art methods for feature selection ...

Please sign up or login with your details

Forgot password? Click here to reset