Statistical inference for principal components of spiked covariance matrices

08/27/2020
by   Zhigang Bao, et al.
0

In this paper, we study the asymptotic behavior of the extreme eigenvalues and eigenvectors of the high dimensional spiked sample covariance matrices, in the supercritical case when a reliable detection of spikes is possible. Especially, we derive the joint distribution of the extreme eigenvalues and the generalized components of the associated eigenvectors, i.e., the projections of the eigenvectors onto arbitrary given direction, assuming that the dimension and sample size are comparably large. In general, the joint distribution is given in terms of linear combinations of finitely many Gaussian and Chi-square variables, with parameters depending on the projection direction and the spikes. Our assumption on the spikes is fully general. First, the strengths of spikes are only required to be slightly above the critical threshold and no upper bound on the strengths is needed. Second, multiple spikes, i.e., spikes with the same strength, are allowed. Third, no structural assumption is imposed on the spikes. Thanks to the general setting, we can then apply the results to various high dimensional statistical hypothesis testing problems involving both the eigenvalues and eigenvectors. Specifically, we propose accurate and powerful statistics to conduct hypothesis testing on the principal components. These statistics are data-dependent and adaptive to the underlying true spikes. Numerical simulations also confirm the accuracy and powerfulness of our proposed statistics and illustrate significantly better performance compared to the existing methods in the literature. Especially, our methods are accurate and powerful even when either the spikes are small or the dimension is large.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/29/2019

Principal components of spiked covariance matrices in the supercritical regime

In this paper, we study the asymptotic behavior of the extreme eigenvalu...
research
08/21/2020

Large Deviations of Extreme Eigenvalues of Generalized Sample Covariance Matrices

Very rare events in which the largest eigenvalue of a random matrix is a...
research
08/10/2020

Tracy-Widom distribution for the edge eigenvalues of Gram type random matrices

Large dimensional Gram type matrices are common objects in high-dimensio...
research
06/23/2019

Asymptotic joint distribution of extreme eigenvalues and trace of large sample covariance matrix in a generalized spiked population model

This paper studies the joint limiting behavior of extreme eigenvalues an...
research
03/06/2023

Extreme eigenvalues of sample covariance matrices under generalized elliptical models with applications

We consider the extreme eigenvalues of the sample covariance matrix Q=YY...
research
03/22/2019

Principal components in linear mixed models with general bulk

We study the outlier eigenvalues and eigenvectors in variance components...

Please sign up or login with your details

Forgot password? Click here to reset