Quantifying the Estimation Error of Principal Components

10/27/2017
by   Raphael Hauser, et al.
0

Principal component analysis is an important pattern recognition and dimensionality reduction tool in many applications. Principal components are computed as eigenvectors of a maximum likelihood covariance Σ that approximates a population covariance Σ, and these eigenvectors are often used to extract structural information about the variables (or attributes) of the studied population. Since PCA is based on the eigendecomposition of the proxy covariance Σ rather than the ground-truth Σ, it is important to understand the approximation error in each individual eigenvector as a function of the number of available samples. The recent results of Kolchinskii and Lounici yield such bounds. In the present paper we sharpen these bounds and show that eigenvectors can often be reconstructed to a required accuracy from a sample of strictly smaller size order.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/29/2022

MML Probabilistic Principal Component Analysis

Principal component analysis (PCA) is perhaps the most widely method for...
research
11/07/2018

A note on the prediction error of principal component regression

We analyse the prediction error of principal component regression (PCR) ...
research
01/05/2021

A Linearly Convergent Algorithm for Distributed Principal Component Analysis

Principal Component Analysis (PCA) is the workhorse tool for dimensional...
research
09/10/2021

Principal component analysis for high-dimensional compositional data

Dimension reduction for high-dimensional compositional data plays an imp...
research
10/22/2014

Demixed principal component analysis of population activity in higher cortical areas reveals independent representation of task parameters

Neurons in higher cortical areas, such as the prefrontal cortex, are kno...
research
07/28/2023

Stratified Principal Component Analysis

This paper investigates a general family of models that stratifies the s...
research
12/23/2019

Quantifying the Effects of the 2008 Recession using the Zillow Dataset

This report explores the use of Zillow's housing metrics dataset to inve...

Please sign up or login with your details

Forgot password? Click here to reset