Subspace Perspective on Canonical Correlation Analysis: Dimension Reduction and Minimax Rates

05/12/2016
by   Zhuang Ma, et al.
0

Canonical correlation analysis (CCA) is a fundamental statistical tool for exploring the correlation structure between two sets of random variables. In this paper, motivated by recent success of applying CCA to learn low dimensional representations of high dimensional objects, we propose to quantify the estimation loss of CCA by the excess prediction loss defined through a prediction-after-dimension-reduction framework. Such framework suggests viewing CCA estimation as estimating the subspaces spanned by the canonical variates. Interestedly, the proposed error metrics derived from the excess prediction loss turn out to be closely related to the principal angles between the subspaces spanned by the population and sample canonical variates respectively. We characterize the non-asymptotic minimax rates under the proposed metrics, especially the dependency of the minimax rates on the key quantities including the dimensions, the condition number of the covariance matrices, the canonical correlations and the eigen-gap, with minimal assumptions on the joint covariance matrix. To the best of our knowledge, this is the first finite sample result that captures the effect of the canonical correlations on the minimax rates.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2020

Sample canonical correlation coefficients of high-dimensional random vectors: local law and Tracy-Widom limit

Consider two random vectors C_1^1/2x∈R^p and C_2^1/2y∈R^q, where the ent...
research
05/08/2018

Optimal Subspace Estimation Using Overidentifying Vectors via Generalized Method of Moments

Many statistical models seek relationship between variables via subspace...
research
09/26/2019

Estimating covariance and precision matrices along subspaces

We study the accuracy of estimating the covariance and the precision mat...
research
11/02/2012

Minimax sparse principal subspace estimation in high dimensions

We study sparse principal components analysis in high dimensions, where ...
research
07/10/2020

Principal Loading Analysis

This paper proposes a tool for dimension reduction where the dimension o...
research
02/28/2019

Distance-Based Independence Screening for Canonical Analysis

This paper introduces a new method named Distance-based Independence Scr...
research
10/16/2020

Minimax Quasi-Bayesian estimation in sparse canonical correlation analysis via a Rayleigh quotient function

Canonical correlation analysis (CCA) is a popular statistical technique ...

Please sign up or login with your details

Forgot password? Click here to reset