Priming PCA with EigenGame

09/08/2021
by   Bálint Máté, et al.
0

We introduce primed-PCA (pPCA), an extension of the recently proposed EigenGame algorithm for computing principal components in a large-scale setup. Our algorithm first runs EigenGame to get an approximation of the principal components, and then applies an exact PCA in the subspace they span. Since this subspace is of small dimension in any practical use of EigenGame, this second step is extremely cheap computationally. Nonetheless, it improves accuracy significantly for a given computational budget across datasets. In this setup, the purpose of EigenGame is to narrow down the search space, and prepare the data for the second step, an exact calculation. We show formally that pPCA improves upon EigenGame under very mild conditions, and we provide experimental validation on both synthetic and real large-scale datasets showing that it systematically translates to improved performance. In our experiments we achieve improvements in convergence speed by factors of 5-25 on the datasets of the original EigenGame paper.

READ FULL TEXT

page 6

page 9

page 10

page 11

page 13

research
02/03/2017

Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning

Principal Component Analysis (PCA) is a fundamental method for estimatin...
research
05/11/2020

Solving Large-Scale Sparse PCA to Certifiable (Near) Optimality

Sparse principal component analysis (PCA) is a popular dimensionality re...
research
11/25/2019

ROIPCA: An Online PCA algorithm based on rank-one updates

Principal components analysis (PCA) is a fundamental algorithm in data a...
research
02/02/2016

On the Nyström and Column-Sampling Methods for the Approximate Principal Components Analysis of Large Data Sets

In this paper we analyze approximate methods for undertaking a principal...
research
07/28/2019

Multi-Rank Sparse and Functional PCA: Manifold Optimization and Iterative Deflation Techniques

We consider the problem of estimating multiple principal components usin...
research
10/28/2016

Correlated-PCA: Principal Components' Analysis when Data and Noise are Correlated

Given a matrix of observed data, Principal Components Analysis (PCA) com...
research
06/18/2012

A Linear Approximation to the chi^2 Kernel with Geometric Convergence

We propose a new analytical approximation to the χ^2 kernel that converg...

Please sign up or login with your details

Forgot password? Click here to reset