Unsupervised Feature Selection via Multi-step Markov Transition Probability

05/29/2020
by   Yan Min, et al.
0

Feature selection is a widely used dimension reduction technique to select feature subsets because of its interpretability. Many methods have been proposed and achieved good results, in which the relationships between adjacent data points are mainly concerned. But the possible associations between data pairs that are may not adjacent are always neglected. Different from previous methods, we propose a novel and very simple approach for unsupervised feature selection, named MMFS (Multi-step Markov transition probability for Feature Selection). The idea is using multi-step Markov transition probability to describe the relation between any data pair. Two ways from the positive and negative viewpoints are employed respectively to keep the data structure after feature selection. From the positive viewpoint, the maximum transition probability that can be reached in a certain number of steps is used to describe the relation between two points. Then, the features which can keep the compact data structure are selected. From the viewpoint of negative, the minimum transition probability that can be reached in a certain number of steps is used to describe the relation between two points. On the contrary, the features that least maintain the loose data structure are selected. And the two ways can also be combined. Thus three algorithms are proposed. Our main contributions are a novel feature section approach which uses multi-step transition probability to characterize the data structure, and three algorithms proposed from the positive and negative aspects for keeping data structure. The performance of our approach is compared with the state-of-the-art methods on eight real-world data sets, and the experimental results show that the proposed MMFS is effective in unsupervised feature selection.

READ FULL TEXT
research
08/25/2018

Unsupervised Hypergraph Feature Selection via a Novel Point-Weighting Framework and Low-Rank Representation

Feature selection methods are widely used in order to solve the 'curse o...
research
02/25/2019

Epileptic seizure classification using statistical sampling and a novel feature selection algorithm

Epilepsy is a well-known neuronal disorder that can be identified by int...
research
11/10/2022

A metaheuristic multi-objective interaction-aware feature selection method

Multi-objective feature selection is one of the most significant issues ...
research
12/10/2019

Unsupervised Feature Selection based on Adaptive Similarity Learning and Subspace Clustering

Feature selection methods have an important role on the readability of d...
research
01/31/2022

Compactness Score: A Fast Filter Method for Unsupervised Feature Selection

For feature engineering, feature selection seems to be an important rese...
research
06/12/2018

MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

Feature selection is an important challenge in machine learning. It play...
research
11/12/2021

Bayesian Knockoff Generators for Robust Inference Under Complex Data Structure

The recent proliferation of medical data, such as genetics and electroni...

Please sign up or login with your details

Forgot password? Click here to reset