Feature Selection via Mutual Information: New Theoretical Insights

07/17/2019
by   Mario Beraha, et al.
0

Mutual information has been successfully adopted in filter feature-selection methods to assess both the relevancy of a subset of features in predicting the target variable and the redundancy with respect to other variables. However, existing algorithms are mostly heuristic and do not offer any guarantee on the proposed solution. In this paper, we provide novel theoretical results showing that conditional mutual information naturally arises when bounding the ideal regression/classification errors achieved by different subsets of features. Leveraging on these insights, we propose a novel stopping condition for backward and forward greedy methods which ensures that the ideal prediction error using the selected feature subset remains bounded by a user-specified threshold. We provide numerical simulations to support our theoretical claims and compare to common heuristic methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2018

Simple stopping criteria for information theoretic feature selection

Information theoretic feature selection aims to select a smallest featur...
research
11/24/2014

Mutual Information-Based Unsupervised Feature Transformation for Heterogeneous Feature Subset Selection

Conventional mutual information (MI) based feature selection (FS) method...
research
10/21/2022

An Adaptive Neighborhood Partition Full Conditional Mutual Information Maximization Method for Feature Selection

Feature selection is used to eliminate redundant features and keep relev...
research
10/06/2012

Feature Selection via L1-Penalized Squared-Loss Mutual Information

Feature selection is a technique to screen out less important features. ...
research
06/17/2023

Fair Causal Feature Selection

Causal feature selection has recently received increasing attention in m...
research
01/26/2017

A theoretical framework for evaluating forward feature selection methods based on mutual information

Feature selection problems arise in a variety of applications, such as m...
research
09/14/2023

Causal Entropy and Information Gain for Measuring Causal Control

Artificial intelligence models and methods commonly lack causal interpre...

Please sign up or login with your details

Forgot password? Click here to reset