Sampling without replacement from a high-dimensional finite population

01/27/2023
by   Jiang Hu, et al.
0

It is well known that most of the existing theoretical results in statistics are based on the assumption that the sample is generated with replacement from an infinite population. However, in practice, available samples are almost always collected without replacement. If the population is a finite set of real numbers, whether we can still safely use the results from samples drawn without replacement becomes an important problem. In this paper, we focus on the eigenvalues of high-dimensional sample covariance matrices generated without replacement from finite populations. Specifically, we derive the Tracy-Widom laws for their largest eigenvalues and apply these results to parallel analysis. We provide new insight into the permutation methods proposed by Buja and Eyuboglu in [Multivar Behav Res. 27(4) (1992) 509–540]. Simulation and real data studies are conducted to demonstrate our results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/23/2020

Asymptotic independence of spiked eigenvalues and linear spectral statistics for large sample covariance matrices

We consider general high-dimensional spiked sample covariance models and...
research
06/08/2020

Confidence sequences for sampling without replacement

Many practical tasks involve sampling sequentially without replacement f...
research
09/22/2020

Limiting laws for extreme eigenvalues of large-dimensional spiked Fisher matrices with a divergent number of spikes

Consider the p× p matrix that is the product of a population covariance ...
research
12/06/2019

The limits of the sample spiked eigenvalues for a high-dimensional generalized Fisher matrix and its applications

A generalized spiked Fisher matrix is considered in this paper. We estab...
research
06/18/2020

Neutralizing Self-Selection Bias in Sampling for Sortition

Sortition is a political system in which decisions are made by panels of...
research
01/03/2021

Better understanding of the multivariate hypergeometric distribution with implications in design-based survey sampling

Multivariate hypergeometric distribution arises frequently in elementary...
research
01/07/2021

Parallel Hyperedge Replacement Grammars

In 2018, it was shown that all finitely generated virtually Abelian grou...

Please sign up or login with your details

Forgot password? Click here to reset