Principal component analysis for high-dimensional compositional data

09/10/2021
by   Jingru Zhang, et al.
0

Dimension reduction for high-dimensional compositional data plays an important role in many fields, where the principal component analysis of the basis covariance matrix is of scientific interest. In practice, however, the basis variables are latent and rarely observed, and standard techniques of principal component analysis are inadequate for compositional data because of the simplex constraint. To address the challenging problem, we relate the principal subspace of the centered log-ratio compositional covariance to that of the basis covariance, and prove that the latter is approximately identifiable with the diverging dimensionality under some subspace sparsity assumption. The interesting blessing-of-dimensionality phenomenon enables us to propose the principal subspace estimation methods by using the sample centered log-ratio covariance. We also derive nonasymptotic error bounds for the subspace estimators, which exhibits a tradeoff between identification and estimation. Moreover, we develop efficient proximal alternating direction method of multipliers algorithms to solve the nonconvex and nonsmooth optimization problems. Simulation results demonstrate that the proposed methods perform as well as the oracle methods with known basis. Their usefulness is illustrated through an analysis of word usage pattern for statisticians.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2020

Robust Covariance Estimation for High-dimensional Compositional Data with Application to Microbial Communities Analysis

Microbial communities analysis is drawing growing attention due to the r...
research
12/20/2017

Independent component analysis for multivariate functional data

We extend two methods of independent component analysis, fourth order bl...
research
09/20/2023

Principal component analysis in Bayes spaces for sparsely sampled density functions

This paper presents a novel approach to functional principal component a...
research
03/05/2021

Density ratio model with data-adaptive basis function

In many applications, we collect independent samples from interconnected...
research
12/03/2019

A Fast deflation Method for Sparse Principal Component Analysis via Subspace Projections

Deflation method is an iterative technique that searches the sparse load...
research
10/11/2020

Infrared target tracking based on proximal robust principal component analysis method

Infrared target tracking plays an important role in both civil and milit...
research
10/27/2017

Quantifying the Estimation Error of Principal Components

Principal component analysis is an important pattern recognition and dim...

Please sign up or login with your details

Forgot password? Click here to reset