Estimating Number of Factors by Adjusted Eigenvalues Thresholding

09/24/2019
by   Jianqing Fan, et al.
0

Determining the number of common factors is an important and practical topic in high dimensional factor models. The existing literatures are mainly based on the eigenvalues of the covariance matrix. Due to the incomparability of the eigenvalues of the covariance matrix caused by heterogeneous scales of observed variables, it is very difficult to give an accurate relationship between these eigenvalues and the number of common factors. To overcome this limitation, we appeal to the correlation matrix and show surprisingly that the number of eigenvalues greater than 1 of population correlation matrix is the same as the number of common factors under some mild conditions. To utilize such a relationship, we study the random matrix theory based on the sample correlation matrix in order to correct the biases in estimating the top eigenvalues and to take into account of estimation errors in eigenvalue estimation. This leads us to propose adjusted correlation thresholding (ACT) for determining the number of common factors in high dimensional factor models, taking into account the sampling variabilities and biases of top sample eigenvalues. We also establish the optimality of the proposed methods in terms of minimal signal strength and optimal threshold. Simulation studies lend further support to our proposed method and show that our estimator outperforms other competing methods in most of our testing cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2022

Testing the number of common factors by bootstrap in high-dimensional factor models

This paper proposes to test the number of common factors in high-dimensi...
research
11/02/2018

RSVP-graphs: Fast High-dimensional Covariance Matrix Estimation under Latent Confounding

In this work we consider the problem of estimating a high-dimensional p ...
research
03/03/2022

A Correlation Thresholding Algorithm for Learning Factor Analysis Models

Factor analysis is a widely used method for modeling a set of observed v...
research
01/22/2015

Estimating the Intrinsic Dimension of Hyperspectral Images Using an Eigen-Gap Approach

Linear mixture models are commonly used to represent hyperspectral datac...
research
06/10/2018

Determining the dimension of factor structures in non-stationary large datasets

We propose a procedure to determine the dimension of the common factor s...
research
05/31/2020

Estimation of the number of spiked eigenvalues in a covariance matrix by bulk eigenvalue matching analysis

The spiked covariance model has gained increasing popularity in high-dim...
research
01/31/2019

Determining the Dimension and Structure of the Subspace Correlated Across Multiple Data Sets

Detecting the components common or correlated across multiple data sets ...

Please sign up or login with your details

Forgot password? Click here to reset