Resampling Sensitivity of High-Dimensional PCA

12/30/2022
βˆ™
by   Haoyu Wang, et al.
βˆ™
0
βˆ™

The study of stability and sensitivity of statistical methods or algorithms with respect to their data is an important problem in machine learning and statistics. The performance of the algorithm under resampling of the data is a fundamental way to measure its stability and is closely related to generalization or privacy of the algorithm. In this paper, we study the resampling sensitivity for the principal component analysis (PCA). Given an n Γ— p random matrix 𝐗, let 𝐗^[k] be the matrix obtained from 𝐗 by resampling k randomly chosen entries of 𝐗. Let 𝐯 and 𝐯^[k] denote the principal components of 𝐗 and 𝐗^[k]. In the proportional growth regime p/n β†’ΞΎβˆˆ (0,1], we establish the sharp threshold for the sensitivity/stability transition of PCA. When k ≫ n^5/3, the principal components 𝐯 and 𝐯^[k] are asymptotically orthogonal. On the other hand, when k β‰ͺ n^5/3, the principal components 𝐯 and 𝐯^[k] are asymptotically colinear. In words, we show that PCA is sensitive to the input data in the sense that resampling even a negligible portion of the input may completely change the output.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
βˆ™ 10/29/2019

A Generalization of Principal Component Analysis

Conventional principal component analysis (PCA) finds a principal vector...
research
βˆ™ 05/20/2020

Functional principal component analysis for global sensitivity analysis of model with spatial output

Motivated by risk assessment of coastal flooding, we consider time-consu...
research
βˆ™ 12/21/2014

Principal Sensitivity Analysis

We present a novel algorithm (Principal Sensitivity Analysis; PSA) to an...
research
βˆ™ 10/28/2016

Correlated-PCA: Principal Components' Analysis when Data and Noise are Correlated

Given a matrix of observed data, Principal Components Analysis (PCA) com...
research
βˆ™ 11/19/2019

Discussion contribution "Functional models for time-varying random objects” by Dubey and MΓΌller (to appear in JRSS-B)

In an inspiring paper Dubey and MΓΌller (DM) extend PCA to the case that ...
research
βˆ™ 12/17/2021

Joint machine learning analysis of muon spectroscopy data from different materials

Machine learning (ML) methods have proved to be a very successful tool i...
research
βˆ™ 08/09/2022

Sensitivity of principal components to changes in the presence of non-stationarity

Non-stationarity affects the sensitivity of change detection in correlat...

Please sign up or login with your details

Forgot password? Click here to reset