Conditional Uncorrelation and Efficient Non-approximate Subset Selection in Sparse Regression

09/08/2020
by   Jianji Wang, et al.
5

Given m d-dimensional responsors and n d-dimensional predictors, sparse regression finds at most k predictors for each responsor for linearly approximation, 1≤ k ≤ d-1. The key problem in sparse regression is subset selection, which usually suffers from the high computational cost. Here we consider sparse regression from the view of correlation, and propose the formula of conditional uncorrelation. Then an efficient non-approximate method of subset selection is proposed in which we do not need to calculate any linear coefficients for the candidate predictors. By the proposed method, the computational complexity is reduced from O(1/2k^3+kd) to O(1/3k^3) for each candidate subset in sparse regression. Because the dimension d is generally the number of observations or experiments and large enough, the proposed method can significantly improve the efficiency of sparse regression.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2019

Best Subset Selection in Reduced Rank Regression

Reduced rank regression is popularly used for modeling the relationship ...
research
09/12/2023

A Consistent and Scalable Algorithm for Best Subset Selection in Single Index Models

Analysis of high-dimensional data has led to increased interest in both ...
research
04/25/2021

Robust selection of predictors and conditional outlier detection in a perturbed large-dimensional regression context

This paper presents a fast methodology, called ROBOUT, to identify outli...
research
08/29/2023

A Novel Dual Predictors Framework of PEE

In this paper, we propose a improved 2D-PEH based on double prediction-e...
research
06/10/2019

Selection consistency of Lasso-based procedures for misspecified high-dimensional binary model and random regressors

We consider selection of random predictors for high-dimensional regressi...
research
03/30/2023

KOO approach for scalable variable selection problem in large-dimensional regression

An important issue in many multivariate regression problems is to elimin...
research
01/27/2017

Subset Selection for Multiple Linear Regression via Optimization

Subset selection in multiple linear regression is to choose a subset of ...

Please sign up or login with your details

Forgot password? Click here to reset