Adaptive Graph-based Generalized Regression Model for Unsupervised Feature Selection

12/27/2020
by   Yanyong Huang, et al.
27

Unsupervised feature selection is an important method to reduce dimensions of high dimensional data without labels, which is benefit to avoid “curse of dimensionality” and improve the performance of subsequent machine learning tasks, like clustering and retrieval. How to select the uncorrelated and discriminative features is the key problem of unsupervised feature selection. Many proposed methods select features with strong discriminant and high redundancy, or vice versa. However, they only satisfy one of these two criteria. Other existing methods choose the discriminative features with low redundancy by constructing the graph matrix on the original feature space. Since the original feature space usually contains redundancy and noise, it will degrade the performance of feature selection. In order to address these issues, we first present a novel generalized regression model imposed by an uncorrelated constraint and the ℓ_2,1-norm regularization. It can simultaneously select the uncorrelated and discriminative features as well as reduce the variance of these data points belonging to the same neighborhood, which is help for the clustering task. Furthermore, the local intrinsic structure of data is constructed on the reduced dimensional space by learning the similarity-induced graph adaptively. Then the learnings of the graph structure and the indicator matrix based on the spectral analysis are integrated into the generalized regression model. Finally, we develop an alternative iterative optimization algorithm to solve the objective function. A series of experiments are carried out on nine real-world data sets to demonstrate the effectiveness of the proposed method in comparison with other competing approaches.

READ FULL TEXT

page 18

page 19

research
12/29/2020

Sparse PCA via l_2,p-Norm Regularization for Unsupervised Feature Selection

In the field of data mining, how to deal with high-dimensional data is a...
research
10/09/2020

Nonnegative Spectral Analysis with Adaptive Graph and L_2,0-Norm Regularization for Unsupervised Feature Selection

Feature selection is an important data preprocessing in data mining and ...
research
01/03/2019

Adaptive Locality Preserving Regression

This paper proposes a novel discriminative regression method, called ada...
research
09/08/2018

Identifying The Most Informative Features Using A Structurally Interacting Elastic Net

Feature selection can efficiently identify the most informative features...
research
01/15/2016

Improved graph-based SFA: Information preservation complements the slowness principle

Slow feature analysis (SFA) is an unsupervised-learning algorithm that e...
research
10/09/2019

Supervised feature selection with orthogonal regression and feature weighting

Effective features can improve the performance of a model, which can thu...
research
02/26/2019

Fused Lasso for Feature Selection using Structural Information

Feature selection has been proven a powerful preprocessing step for high...

Please sign up or login with your details

Forgot password? Click here to reset