Online Principal Component Analysis in High Dimension: Which Algorithm to Choose?

11/11/2015
by   Hervé Cardot, et al.
0

In the current context of data explosion, online techniques that do not require storing all data in memory are indispensable to routinely perform tasks like principal component analysis (PCA). Recursive algorithms that update the PCA with each new observation have been studied in various fields of research and found wide applications in industrial monitoring, computer vision, astronomy, and latent semantic indexing, among others. This work provides guidance for selecting an online PCA algorithm in practice. We present the main approaches to online PCA, namely, perturbation techniques, incremental methods, and stochastic optimization, and compare their statistical accuracy, computation time, and memory requirements using artificial and real data. Extensions to missing data and to functional data are discussed. All studied algorithms are available in the R package onlinePCA on CRAN.

READ FULL TEXT

page 21

page 22

research
11/04/2021

Extended Principal Component Analysis

Principal Component Analysis (PCA) is a transform for finding the princi...
research
01/23/2019

Incremental Principal Component Analysis Exact implementation and continuity corrections

This paper describes some applications of an incremental implementation ...
research
01/07/2019

Stochastic Approximation Algorithms for Principal Component Analysis

Principal Component Analysis is a novel way of of dimensionality reducti...
research
03/26/2018

Cluster analysis of stocks using price movements of high frequency data from National Stock Exchange

This paper aims to develop new techniques to describe joint behavior of ...
research
12/19/2018

Fast Botnet Detection From Streaming Logs Using Online Lanczos Method

Botnet, a group of coordinated bots, is becoming the main platform of ma...
research
03/12/2020

On the Convergence of the Dynamic Inner PCA Algorithm

Dynamic inner principal component analysis (DiPCA) is a powerful method ...
research
01/23/2019

Online Adaptive Principal Component Analysis and Its extensions

We propose algorithms for online principal component analysis (PCA) and ...

Please sign up or login with your details

Forgot password? Click here to reset