James-Stein estimation of the first principal component

09/05/2021
by   Alex Shkolnik, et al.
0

The Stein paradox has played an influential role in the field of high dimensional statistics. This result warns that the sample mean, classically regarded as the "usual estimator", may be suboptimal in high dimensions. The development of the James-Stein estimator, that addresses this paradox, has by now inspired a large literature on the theme of "shrinkage" in statistics. In this direction, we develop a James-Stein type estimator for the first principal component of a high dimension and low sample size data set. This estimator shrinks the usual estimator, an eigenvector of a sample covariance matrix under a spiked covariance model, and yields superior asymptotic guarantees. Our derivation draws a close connection to the original James-Stein formula so that the motivation and recipe for shrinkage is intuited in a natural way.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2016

Cross-validation based Nonlinear Shrinkage

Many machine learning algorithms require precise estimates of covariance...
research
03/22/2021

Optimal Linear Classification via Eigenvalue Shrinkage: The Case of Additive Noise

In this paper, we consider the general problem of testing the mean of tw...
research
09/14/2023

Spectrum-Aware Adjustment: A New Debiasing Framework with Applications to Principal Components Regression

We introduce a new debiasing framework for high-dimensional linear regre...
research
03/09/2022

High Dimensional Statistical Analysis and its Application to ALMA Map of NGC 253

In astronomy, if we denote the dimension of data as d and the number of ...
research
10/10/2022

Optimal Eigenvalue Shrinkage in the Semicircle Limit

Recent studies of high-dimensional covariance estimation often assume th...
research
08/17/2018

Inconsistency of diagonal scaling under high-dimensional limit: a replica approach

In this note, we claim that diagonal scaling of a sample covariance matr...
research
10/26/2022

R-NL: Fast and Robust Covariance Estimation for Elliptical Distributions in High Dimensions

We combine Tyler's robust estimator of the dispersion matrix with nonlin...

Please sign up or login with your details

Forgot password? Click here to reset