Fast Parallel Algorithms for Feature Selection

03/06/2019
by   Sharon Qian, et al.
0

In this paper, we analyze a fast parallel algorithm to efficiently select and build a set of k random variables from a large set of n candidate elements. This combinatorial optimization problem can be viewed in the context of feature selection for the prediction of a response variable. Using the adaptive sampling technique, which has recently been shown to exponentially speed up submodular maximization algorithms, we propose a new parallelizable algorithm that dramatically speeds up previous selection algorithms by reducing the number of rounds from O(k) to O( n) for objectives that do not conform to the submodularity property. We introduce a new metric to quantify the closeness of the objective function to submodularity and analyze the performance of adaptive sampling under this regime. We also conduct experiments on synthetic and real datasets and show that the empirical performance of adaptive sampling on not-submodular objectives greatly outperforms its theoretical lower bound. Additionally, the empirical running time drastically improved in all experiments without comprising the terminal value, showing the practicality of adaptive sampling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2018

An Exponential Speedup in Parallel Running Time for Submodular Maximization without Loss in Approximation

In this paper we study the adaptivity of submodular maximization. Adapti...
research
02/19/2011

Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection

We study the problem of selecting a subset of k random variables from a ...
research
03/08/2017

Scalable Greedy Feature Selection via Weak Submodularity

Greedy algorithms are widely used for problems in machine learning such ...
research
07/30/2018

Non-monotone Submodular Maximization in Exponentially Fewer Iterations

In this paper we consider parallelization for applications whose objecti...
research
02/28/2022

Fast Feature Selection with Fairness Constraints

We study the fundamental problem of selecting optimal features for model...
research
06/30/2021

The Power of Adaptivity for Stochastic Submodular Cover

In the stochastic submodular cover problem, the goal is to select a subs...

Please sign up or login with your details

Forgot password? Click here to reset