Deterministic parallel analysis

11/11/2017
by   Edgar Dobriban, et al.
0

Factor analysis is widely used in many application areas. The first step, choosing the number of factors, remains a serious challenge. One of the most popular methods is parallel analysis (PA), which compares the observed factor strengths to simulated ones under a noise-only model. This paper presents a deterministic version of PA (DPA), which is faster and more reproducible than PA. We show that DPA selects large factors and does not select small factors just like [Dobriban, 2017] shows for PA. Both PA and DPA are prone to a shadowing phenomenon in which a strong factor makes it hard to detect smaller but more interesting factors. We develop a deflated version of DPA (DDPA) that counters shadowing. By raising the decision threshold in DDPA, a new method (DDPA+) also improves estimation accuracy. We illustrate our methods on data from the Human Genome Diversity Project (HGDP). There PA and DPA select seemingly too many factors, while DDPA+ selects only a few. A Matlab implementation is available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2020

Selecting the number of components in PCA via random signflips

Dimensionality reduction via PCA and factor analysis is an important too...
research
07/30/2018

Guidesort: Simpler Optimal Deterministic Sorting for the Parallel Disk Model

A new algorithm, Guidesort, for sorting in the uniprocessor variant of t...
research
04/16/2019

Helping Effects Against Curse of Dimensionality in Threshold Factor Models for Matrix Time Series

As is known, factor analysis is a popular method to reduce dimension for...
research
04/11/2020

Vintage Factor Analysis with Varimax Performs Statistical Inference

Psychologists developed Multiple Factor Analysis to decompose multivaria...
research
03/26/2021

Divide-and-Conquer: A Distributed Hierarchical Factor Approach to Modeling Large-Scale Time Series Data

This paper proposes a hierarchical approximate-factor approach to analyz...
research
06/21/2020

Decoupling Shrinkage and Selection in Gaussian Linear Factor Analysis

Factor Analysis is a popular method for modeling dependence in multivari...

Please sign up or login with your details

Forgot password? Click here to reset