Some Cautionary Comments on Principal Component Analysis for Time Series Data

08/04/2020
by   Xinyu Zhang, et al.
0

Principal component analysis (PCA) is a most frequently used statistical tool in almost all branches of data science. However, like many other statistical tools, there is sometimes the risk of misuse or even abuse. In this short note, we highlight possible pitfalls in using the theoretical results of PCA based on the assumption of independent data when the data are time series. For the latter, we state a central limit theorem of the eigenvalues and eigenvectors, give analytical and bootstrap methods to estimate the covariance, and assess their efficacy via simulation. An empirical example is given to illustrate the pitfalls of a common misuse of PCA. We conclude that while the conventional scree plot continues to be useful for time series data, the interpretation of the principal component loadings requires careful attention.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2021

Sparse principal component analysis for high-dimensional stationary time series

We consider the sparse principal component analysis for high-dimensional...
research
06/26/2021

Functional Classwise Principal Component Analysis: A Novel Classification Framework

In recent times, functional data analysis (FDA) has been successfully ap...
research
04/21/2021

Principal Component Density Estimation for Scenario Generation Using Normalizing Flows

Neural networks-based learning of the distribution of non-dispatchable r...
research
01/01/2023

PCA-based Data Reduction and Signal Separation Techniques for James-Webb Space Telescope Data Processing

Principal Component Analysis (PCA)-based techniques can separate data in...
research
04/27/2022

On the Use of Dimension Reduction or Signal Separation Methods for Nitrogen River Pollution Source Identification

Identification of the current and expected future pollution sources to r...
research
02/09/2023

Evaluation of population structure inferred by principal component analysis or the admixture model

Principal component analysis (PCA) is commonly used in genetics to infer...
research
09/20/2018

Statistical analysis of astro-geodetic data through principal component analysis, linear modelling and bootstrap based inference

The paper demonstrates the application of statistical based methodology ...

Please sign up or login with your details

Forgot password? Click here to reset