Grouping effects of sparse CCA models in variable selection

08/07/2020
by   Kefei Liu, et al.
0

The sparse canonical correlation analysis (SCCA) is a bi-multivariate association model that finds sparse linear combinations of two sets of variables that are maximally correlated with each other. In addition to the standard SCCA model, a simplified SCCA criterion which maixmizes the cross-covariance between a pair of canonical variables instead of their cross-correlation, is widely used in the literature due to its computational simplicity. However, the behaviors/properties of the solutions of these two models remain unknown in theory. In this paper, we analyze the grouping effect of the standard and simplified SCCA models in variable selection. In high-dimensional settings, the variables often form groups with high within-group correlation and low between-group correlation. Our theoretical analysis shows that for grouped variable selection, the simplified SCCA jointly selects or deselects a group of variables together, while the standard SCCA randomly selects a few dominant variables from each relevant group of correlated variables. Empirical results on synthetic data and real imaging genetics data verify the finding of our theoretical analysis.

READ FULL TEXT

page 6

page 20

page 22

page 26

research
03/11/2016

Efficient Clustering of Correlated Variables and Variable Selection in High-Dimensional Linear Models

In this paper, we introduce Adaptive Cluster Lasso(ACL) method for varia...
research
10/29/2016

A general multiblock method for structured variable selection

Regularised canonical correlation analysis was recently extended to more...
research
01/02/2018

Variable selection in Functional Additive Regression Models

This paper considers the problem of variable selection when some of the ...
research
08/09/2012

High-Dimensional Screening Using Multiple Grouping of Variables

Screening is the problem of finding a superset of the set of non-zero en...
research
06/10/2020

Robust Grouped Variable Selection Using Distributionally Robust Optimization

We propose a Distributionally Robust Optimization (DRO) formulation with...
research
03/24/2014

Simultaneous sparse estimation of canonical vectors in the p>>N setting

This article considers the problem of sparse estimation of canonical vec...
research
08/13/2022

A sequential stepwise screening procedure for sparse recovery in high-dimensional multiresponse models with complex group structures

Multiresponse data with complex group structures in both responses and p...

Please sign up or login with your details

Forgot password? Click here to reset