Fast Feature Selection with Fairness Constraints

02/28/2022
by   Francesco Quinzan, et al.
0

We study the fundamental problem of selecting optimal features for model construction. This problem is computationally challenging on large datasets, even with the use of greedy algorithm variants. To address this challenge, we extend the adaptive query model, recently proposed for the greedy forward selection for submodular functions, to the faster paradigm of Orthogonal Matching Pursuit for non-submodular functions. Our extension also allows the use of downward-closed constraints, which can be used to encode certain fairness criteria into the feature selection process. The proposed algorithm achieves exponentially fast parallel run time in the adaptive query model, scaling much better than prior work. The proposed algorithm also handles certain fairness constraints by design. We prove strong approximation guarantees for the algorithm based on standard assumptions. These guarantees are applicable to many parametric models, including Generalized Linear Models. Finally, we demonstrate empirically that the proposed algorithm competes favorably with state-of-the-art techniques for feature selection, on real-world and synthetic datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2017

Scalable Greedy Feature Selection via Weak Submodularity

Greedy algorithms are widely used for problems in machine learning such ...
research
11/20/2017

Deletion-Robust Submodular Maximization at Scale

Can we efficiently extract useful information from a large user-generate...
research
09/29/2022

Sequential Attention for Feature Selection

Feature selection is the problem of selecting a subset of features for a...
research
02/19/2011

Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection

We study the problem of selecting a subset of k random variables from a ...
research
07/19/2022

Neural Greedy Pursuit for Feature Selection

We propose a greedy algorithm to select N important features among P inp...
research
03/06/2019

Fast Parallel Algorithms for Feature Selection

In this paper, we analyze a fast parallel algorithm to efficiently selec...
research
09/21/2016

Saturating Splines and Feature Selection

We extend the adaptive regression spline model by incorporating saturati...

Please sign up or login with your details

Forgot password? Click here to reset