AFS: An Attention-based mechanism for Supervised Feature Selection

02/28/2019
by   Ning Gui, et al.
0

As an effective data preprocessing step, feature selection has shown its effectiveness to prepare high-dimensional data for many machine learning tasks. The proliferation of high di-mension and huge volume big data, however, has brought major challenges, e.g. computation complexity and stability on noisy data, upon existing feature-selection techniques. This paper introduces a novel neural network-based feature selection architecture, dubbed Attention-based Feature Selec-tion (AFS). AFS consists of two detachable modules: an at-tention module for feature weight generation and a learning module for the problem modeling. The attention module for-mulates correlation problem among features and supervision target into a binary classification problem, supported by a shallow attention net for each feature. Feature weights are generated based on the distribution of respective feature se-lection patterns adjusted by backpropagation during the train-ing process. The detachable structure allows existing off-the-shelf models to be directly reused, which allows for much less training time, demands for the training data and requirements for expertise. A hybrid initialization method is also intro-duced to boost the selection accuracy for datasets without enough samples for feature weight generation. Experimental results show that AFS achieves the best accuracy and stability in comparison to several state-of-art feature selection algo-rithms upon both MNIST, noisy MNIST and several datasets with small samples.

READ FULL TEXT
research
07/19/2022

A-SFS: Semi-supervised Feature Selection based on Multi-task Self-supervision

Feature selection is an important process in machine learning. It builds...
research
03/22/2022

On Supervised Feature Selection from High Dimensional Feature Spaces

The application of machine learning to image and video data often yields...
research
10/13/2016

An Information Theoretic Feature Selection Framework for Big Data under Apache Spark

With the advent of extremely high dimensional datasets, dimensionality r...
research
05/09/2021

Towards Dynamic Feature Selection with Attention to Assist Banking Customers in Establishing a New Business

Establishing a new business may involve Knowledge acquisition in various...
research
06/25/2020

Stochastic Subset Selection

Current machine learning algorithms are designed to work with huge volum...
research
02/11/2021

Feature Selection for Multivariate Time Series via Network Pruning

In recent years, there has been an ever increasing amount of multivariat...
research
07/21/2021

Differentiable Feature Selection, a Reparameterization Approach

We consider the task of feature selection for reconstruction which consi...

Please sign up or login with your details

Forgot password? Click here to reset