Rates of Bootstrap Approximation for Eigenvalues in High-Dimensional PCA

04/15/2021
by   Junwen Yao, et al.
0

In the context of principal components analysis (PCA), the bootstrap is commonly applied to solve a variety of inference problems, such as constructing confidence intervals for the eigenvalues of the population covariance matrix Σ. However, when the data are high-dimensional, there are relatively few theoretical guarantees that quantify the performance of the bootstrap. Our aim in this paper is to analyze how well the bootstrap can approximate the joint distribution of the leading eigenvalues of the sample covariance matrix Σ̂, and we establish non-asymptotic rates of approximation with respect to the multivariate Kolmogorov metric. Under certain assumptions, we show that the bootstrap can achieve the dimension-free rate of r(Σ)/√(n) up to logarithmic factors, where r(Σ) is the effective rank of Σ, and n is the sample size. From a methodological standpoint, our work also illustrates that applying a transformation to the eigenvalues of Σ̂ before bootstrapping is an important consideration in high-dimensional settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2022

Testing the number of common factors by bootstrap in high-dimensional factor models

This paper proposes to test the number of common factors in high-dimensi...
research
04/29/2020

A generalized information criterion for high-dimensional PCA rank selection

Principal component analysis (PCA) is the most commonly used statistical...
research
05/10/2022

Confidence Intervals for the Number of Components in Factor Analysis and Principal Components Analysis via Subsampling

Factor analysis (FA) and principal component analysis (PCA) are popular ...
research
02/01/2022

Bootstrap Confidence Regions for Learned Feature Embeddings

Algorithmic feature learners provide high-dimensional vector representat...
research
09/13/2019

Bootstrapping the Operator Norm in High Dimensions: Error Estimation for Covariance Matrices and Sketching

Although the operator (spectral) norm is one of the most widely used met...
research
06/28/2021

Bootstrapping the error of Oja's Algorithm

We consider the problem of quantifying uncertainty for the estimation er...
research
08/05/2022

Improved Rates of Bootstrap Approximation for the Operator Norm: A Coordinate-Free Approach

Let Σ̂=1/n∑_i=1^n X_i⊗ X_i denote the sample covariance operator of cent...

Please sign up or login with your details

Forgot password? Click here to reset