Evaluation of population structure inferred by principal component analysis or the admixture model

02/09/2023
by   Jan van Waaij, et al.
0

Principal component analysis (PCA) is commonly used in genetics to infer and visualize population structure and admixture between populations. PCA is often interpreted in a way similar to inferred admixture proportions, where it is assumed that individuals belong to one of several possible populations or are admixed between these populations. We propose a new method to assess the statistical fit of PCA (interpreted as a model spanned by the top principal components) and to show that violations of the PCA assumptions affect the fit. Our method uses the chosen top principal components to predict the genotypes. By assessing the covariance (and the correlation) of the residuals (the differences between observed and predicted genotypes), we are able to detect violation of the model assumptions. Based on simulations and genome wide human data we show that our assessment of fit can be used to guide the interpretation of the data and to pinpoint individuals that are not well represented by the chosen principal components. Our method works equally on other similar models, such as the admixture model, where the mean of the data is represented by linear matrix decomposition.

READ FULL TEXT

page 6

page 8

page 9

page 10

research
10/16/2021

Principal Component Analysis versus Factor Analysis

The article discusses selected problems related to both principal compon...
research
02/14/2023

On the Multiway Principal Component Analysis

Multiway data are becoming more and more common. While there are many ap...
research
08/04/2020

Some Cautionary Comments on Principal Component Analysis for Time Series Data

Principal component analysis (PCA) is a most frequently used statistical...
research
11/11/2022

Deep equilibrium models as estimators for continuous latent variables

Principal Component Analysis (PCA) and its exponential family extensions...
research
11/10/2017

New Interpretation of Principal Components Analysis

A new look on the principal component analysis has been presented. First...
research
12/02/2015

Optimal whitening and decorrelation

Whitening, or sphering, is a common preprocessing step in statistical an...
research
06/08/2020

Schrödinger PCA: You Only Need Variances for Eigenmodes

Principal component analysis (PCA) has achieved great success in unsuper...

Please sign up or login with your details

Forgot password? Click here to reset