Significance testing for canonical correlation analysis in high dimensions

10/17/2020
by   Ian W. McKeague, et al.
0

We consider the problem of testing for the presence of linear relationships between large sets of random variables based on a post-selection inference approach to canonical correlation analysis. The challenge is to adjust for the selection of subsets of variables having linear combinations with maximal sample correlation. To this end, we construct a stabilized one-step estimator of the euclidean-norm of the canonical correlations maximized over subsets of variables of pre-specified cardinality. This estimator is shown to be consistent for its target parameter and asymptotically normal provided the dimensions of the variables do not grow too quickly with sample size. We also develop a greedy search algorithm to accurately compute the estimator, leading to a computationally tractable omnibus test for the global null hypothesis that there are no linear relationships between any subsets of variables having the pre-specified cardinality. Further, we develop a confidence interval for the target parameter that takes the variable selection into account.

READ FULL TEXT

page 14

page 20

page 21

research
06/28/2023

High-Dimensional Canonical Correlation Analysis

This paper studies high-dimensional canonical correlation analysis (CCA)...
research
11/07/2017

A Tutorial on Canonical Correlation Methods

Canonical correlation analysis is a family of multivariate statistical m...
research
11/02/2022

Inferring independent sets of Gaussian variables after thresholding correlations

We consider testing whether a set of Gaussian variables, selected from t...
research
11/23/2020

Conditional canonical correlation estimation based on covariates with random forests

Investigating the relationships between two sets of variables helps to u...
research
10/19/2019

Robustifying multiple-set linear canonical analysis with S-estimator

We consider a robust version of multiple-set linear canonical analysis o...
research
11/03/2020

Canonical Correlation Analysis in high dimensions with structured regularization

Canonical Correlation Analysis (CCA) is a technique for measuring the as...
research
03/10/2018

Testing One Hypothesis Multiple Times: The Multidimensional Case

The identification of new rare signals in data, the detection of a sudde...

Please sign up or login with your details

Forgot password? Click here to reset