Feature Selection via the Intervened Interpolative Decomposition and its Application in Diversifying Quantitative Strategies

09/29/2022
by   Jun Lu, et al.
0

In this paper, we propose a probabilistic model for computing an interpolative decomposition (ID) in which each column of the observed matrix has its own priority or importance, so that the end result of the decomposition finds a set of features that are representative of the entire set of features, and the selected features also have higher priority than others. This approach is commonly used for low-rank approximation, feature selection, and extracting hidden patterns in data, where the matrix factors are latent variables associated with each data dimension. Gibbs sampling for Bayesian inference is applied to carry out the optimization. We evaluate the proposed models on real-world datasets, including ten Chinese A-share stocks, and demonstrate that the proposed Bayesian ID algorithm with intervention (IID) produces comparable reconstructive errors to existing Bayesian ID algorithms while selecting features with higher scores or priority.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2022

Bayesian Low-Rank Interpolative Decomposition for Complex Datasets

In this paper, we introduce a probabilistic model for learning interpola...
research
06/29/2022

Comparative Study of Inference Methods for Interpolative Decomposition

In this paper, we propose a probabilistic model with automatic relevance...
research
08/22/2022

Robust Bayesian Nonnegative Matrix Factorization with Implicit Regularizers

We introduce a probabilistic model with implicit norm regularization for...
research
10/02/2022

Ensembling improves stability and power of feature selection for deep learning models

With the growing adoption of deep learning models in different real-worl...
research
07/13/2017

Comparative Study of Inference Methods for Bayesian Nonnegative Matrix Factorisation

In this paper, we study the trade-offs of different inference approaches...
research
08/19/2016

Unsupervised Feature Selection Based on the Morisita Estimator of Intrinsic Dimension

This paper deals with a new filter algorithm for selecting the smallest ...
research
04/05/2023

Selecting Features by their Resilience to the Curse of Dimensionality

Real-world datasets are often of high dimension and effected by the curs...

Please sign up or login with your details

Forgot password? Click here to reset