Subspace Learning for Feature Selection via Rank Revealing QR Factorization: Unsupervised and Hybrid Approaches with Non-negative Matrix Factorization and Evolutionary Algorith

10/02/2022
by   Amir Moslemi, et al.
0

The selection of most informative and discriminative features from high-dimensional data has been noticed as an important topic in machine learning and data engineering. Using matrix factorization-based techniques such as nonnegative matrix factorization for feature selection has emerged as a hot topic in feature selection. The main goal of feature selection using matrix factorization is to extract a subspace which approximates the original space but in a lower dimension. In this study, rank revealing QR (RRQR) factorization, which is computationally cheaper than singular value decomposition (SVD), is leveraged in obtaining the most informative features as a novel unsupervised feature selection technique. This technique uses the permutation matrix of QR for feature selection which is a unique property to this factorization method. Moreover, QR factorization is embedded into non-negative matrix factorization (NMF) objective function as a new unsupervised feature selection method. Lastly, a hybrid feature selection algorithm is proposed by coupling RRQR, as a filter-based technique, and a Genetic algorithm as a wrapper-based technique. In this method, redundant features are removed using RRQR factorization and the most discriminative subset of features are selected using the Genetic algorithm. The proposed algorithm shows to be dependable and robust when compared against state-of-the-art feature selection algorithms in supervised, unsupervised, and semi-supervised settings. All methods are tested on seven available microarray datasets using KNN, SVM and C4.5 classifiers. In terms of evaluation metrics, the experimental results shows that the proposed method is comparable with the state-of-the-art feature selection.

READ FULL TEXT

page 18

page 22

page 23

page 24

page 25

page 26

research
03/24/2021

Feature Weighted Non-negative Matrix Factorization

Non-negative Matrix Factorization (NMF) is one of the most popular techn...
research
11/02/2022

Rank Selection for Non-negative Matrix Factorization

Non-Negative Matrix Factorization (NMF) is a widely used dimension reduc...
research
02/27/2020

High-Dimensional Feature Selection for Genomic Datasets

In the presence of large dimensional datasets that contain many irreleva...
research
03/03/2021

PIntMF: Penalized Integrative Matrix Factorization Method for Multi-Omics Data

It is more and more common to explore the genome at diverse levels and n...
research
11/12/2019

Semi-supervised Wrapper Feature Selection with Imperfect Labels

In this paper, we propose a new wrapper approach for semi-supervised fea...
research
02/02/2018

Generating Redundant Features with Unsupervised Multi-Tree Genetic Programming

Recently, feature selection has become an increasingly important area of...
research
06/04/2021

Analysis of the robustness of NMF algorithms

We examine three non-negative matrix factorization techniques; L2-norm, ...

Please sign up or login with your details

Forgot password? Click here to reset