Perturbed factor analysis: Improving generalizability across studies

10/07/2019
by   Arkaprava Roy, et al.
0

Factor analysis is routinely used for dimensionality reduction. However, a major issue is `brittleness' in which one can obtain substantially different factors in analyzing similar datasets. Factor models have been developed for multi-study data by using additive expansions incorporating common and study-specific factors. However, allowing study-specific factors runs counter to the goal of producing a single set of factors that hold across studies. As an alternative, we propose a class of Perturbed Factor Analysis (PFA) models that assume a common factor structure across studies after perturbing the data via multiplication by a study-specific matrix. Bayesian inference algorithms can be easily modified in this case by using a matrix normal hierarchical model for the perturbation matrices. The resulting model is just as flexible as current approaches in allowing arbitrarily large differences across studies, but has substantial advantages that we illustrate in simulation studies and an application to NHANES data. We additionally show advantages of PFA in single study data analyses in which we assign each individual their own perturbation matrix, including reduced generalization error and improved identifiability.

READ FULL TEXT

page 5

page 12

page 14

page 15

page 18

page 19

page 20

page 22

research
07/24/2020

Bayesian Combinatorial Multi-Study Factor Analysis

Analyzing multiple studies allows leveraging data from a range of source...
research
10/09/2019

Bayesian factor models for multivariate categorical data obtained from questionnaires

Factor analysis is a flexible technique for assessment of multivariate d...
research
06/26/2018

Bayesian Multi-study Factor Analysis for High-throughput Biological Data

This paper presents a new modeling strategy for joint unsupervised analy...
research
01/28/2020

On the Dimensional Indeterminacy of One-Wave Factor Analysis Under Causal Effects

It is shown, with two sets of survey items that separately load on two d...
research
01/28/2022

R-factor analysis of data generated by a combination of R- and Q-factors leads to biased loading estimates

The effect of combined, generating R- and Q-factors of measured variable...
research
03/11/2016

Matrix factoring by fraction-free reduction

We consider exact matrix decomposition by Gauss-Bareiss reduction. We in...
research
08/12/2020

A Bayesian Approach to Spherical Factor Analysis for Binary Data

Factor models are widely used across diverse areas of application for pu...

Please sign up or login with your details

Forgot password? Click here to reset