Efficient and Effective L_0 Feature Selection

08/07/2018
by   Ana Kenney, et al.
0

Because of continuous advances in mathematical programing, Mix Integer Optimization has become a competitive vis-a-vis popular regularization method for selecting features in regression problems. The approach exhibits unquestionable foundational appeal and versatility, but also poses important challenges. We tackle these challenges, reducing computational burden when tuning the sparsity bound (a parameter which is critical for effectiveness) and improving performance in the presence of feature collinearity and of signals that vary in nature and strength. Importantly, we render the approach efficient and effective in applications of realistic size and complexity - without resorting to relaxations or heuristics in the optimization, or abandoning rigorous cross-validation tuning. Computational viability and improved performance in subtler scenarios is achieved with a multi-pronged blueprint, leveraging characteristics of the Mixed Integer Programming framework and by means of whitening, a data pre-processing step.

READ FULL TEXT
research
02/20/2023

Model-based feature selection for neural networks: A mixed-integer programming approach

In this work, we develop a novel input feature selection framework for R...
research
05/28/2022

Feature subset selection for kernel SVM classification via mixed-integer optimization

We study the mixed-integer optimization (MIO) approach to feature subset...
research
08/04/2020

No Cross-Validation Required: An Analytical Framework for Regularized Mixed-Integer Problems (Extended Version)

This paper develops a method to obtain the optimal value for the regular...
research
09/12/2022

Bilevel Optimization for Feature Selection in the Data-Driven Newsvendor Problem

We study the feature-based newsvendor problem, in which a decision-maker...
research
08/24/2023

A Strength and Sparsity Preserving Algorithm for Generating Weighted, Directed Networks with Predetermined Assortativity

Degree-preserving rewiring is a widely used technique for generating unw...
research
12/12/2019

Sequential vs. Integrated Algorithm Selection and Configuration: A Case Study for the Modular CMA-ES

When faced with a specific optimization problem, choosing which algorith...
research
11/21/2021

A stochastic extended Rippa's algorithm for LpOCV

In kernel-based approximation, the tuning of the so-called shape paramet...

Please sign up or login with your details

Forgot password? Click here to reset