SLM: End-to-end Feature Selection via Sparse Learnable Masks

04/06/2023
by   Yihe Dong, et al.
0

Feature selection has been widely used to alleviate compute requirements during training, elucidate model interpretability, and improve model generalizability. We propose SLM – Sparse Learnable Masks – a canonical approach for end-to-end feature selection that scales well with respect to both the feature dimension and the number of samples. At the heart of SLM lies a simple but effective learnable sparse mask, which learns which features to select, and gives rise to a novel objective that provably maximizes the mutual information (MI) between the selected features and the labels, which can be derived from a quadratic relaxation of mutual information from first principles. In addition, we derive a scaling mechanism that allows SLM to precisely control the number of features selected, through a novel use of sparsemax. This allows for more effective learning as demonstrated in ablation studies. Empirically, SLM achieves state-of-the-art results against a variety of competitive baselines on eight benchmark datasets, often by a significant margin, especially on those with real-world challenges such as class imbalance.

READ FULL TEXT
research
10/06/2012

Feature Selection via L1-Penalized Squared-Loss Mutual Information

Feature selection is a technique to screen out less important features. ...
research
12/02/2018

Feature Selection Based on Unique Relevant Information for Health Data

Feature selection, which searches for the most representative features i...
research
12/13/2020

Active Feature Selection for the Mutual Information Criterion

We study active feature selection, a novel feature selection setting in ...
research
01/21/2021

Orthogonal Least Squares Based Fast Feature Selection for Linear Classification

An Orthogonal Least Squares (OLS) based feature selection method is prop...
research
07/18/2022

High-Order Conditional Mutual Information Maximization for dealing with High-Order Dependencies in Feature Selection

This paper presents a novel feature selection method based on the condit...
research
06/05/2023

Estimating Conditional Mutual Information for Dynamic Feature Selection

Dynamic feature selection, where we sequentially query features to make ...
research
01/26/2017

A theoretical framework for evaluating forward feature selection methods based on mutual information

Feature selection problems arise in a variety of applications, such as m...

Please sign up or login with your details

Forgot password? Click here to reset