High Dimensional Semiparametric Scale-Invariant Principal Component Analysis

02/18/2014
by   Fang Han, et al.
0

We propose a new high dimensional semiparametric principal component analysis (PCA) method, named Copula Component Analysis (COCA). The semiparametric model assumes that, after unspecified marginally monotone transformations, the distributions are multivariate Gaussian. COCA improves upon PCA and sparse PCA in three aspects: (i) It is robust to modeling assumptions; (ii) It is robust to outliers and data contamination; (iii) It is scale-invariant and yields more interpretable results. We prove that the COCA estimators obtain fast estimation rates and are feature selection consistent when the dimension is nearly exponentially large relative to the sample size. Careful experiments confirm that COCA outperforms sparse PCA on both synthetic and real-world datasets.

READ FULL TEXT

page 26

page 27

page 29

research
11/06/2022

Cauchy robust principal component analysis with applications to high-deimensional data sets

Principal component analysis (PCA) is a standard dimensionality reductio...
research
07/28/2016

Asymptotic properties of Principal Component Analysis and shrinkage-bias adjustment under the Generalized Spiked Population model

With the development of high-throughput technologies, principal componen...
research
06/15/2018

Sparse Principal Component based High-Dimensional Mediation Analysis

Causal mediation analysis aims to quantify the intermediate effect of a ...
research
12/21/2013

Large-Scale Paralleled Sparse Principal Component Analysis

Principal component analysis (PCA) is a statistical technique commonly u...
research
03/09/2022

High Dimensional Statistical Analysis and its Application to ALMA Map of NGC 253

In astronomy, if we denote the dimension of data as d and the number of ...
research
07/15/2021

Principal component analysis for Gaussian process posteriors

This paper proposes an extension of principal component analysis for Gau...
research
10/26/2012

Large-Scale Sparse Principal Component Analysis with Application to Text Data

Sparse PCA provides a linear combination of small number of features tha...

Please sign up or login with your details

Forgot password? Click here to reset