Bayesian nonparametric Principal Component Analysis

09/17/2017
by   Clément Elvira, et al.
0

Principal component analysis (PCA) is very popular to perform dimension reduction. The selection of the number of significant components is essential but often based on some practical heuristics depending on the application. Only few works have proposed a probabilistic approach able to infer the number of significant components. To this purpose, this paper introduces a Bayesian nonparametric principal component analysis (BNP-PCA). The proposed model projects observations onto a random orthogonal basis which is assigned a prior distribution defined on the Stiefel manifold. The prior on factor scores involves an Indian buffet process to model the uncertainty related to the number of components. The parameters of interest as well as the nuisance parameters are finally inferred within a fully Bayesian framework via Monte Carlo sampling. A study of the (in-)consistence of the marginal maximum a posteriori estimator of the latent dimension is carried out. A new estimator of the subspace dimension is proposed. Moreover, for sake of statistical significance, a Kolmogorov-Smirnov test based on the posterior distribution of the principal components is used to refine this estimate. The behaviour of the algorithm is first studied on various synthetic examples. Finally, the proposed BNP dimension reduction approach is shown to be easily yet efficiently coupled with clustering or latent factor models within a unique framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/24/2020

Torus Probabilistic Principal Component Analysis

One of the most common problems that any technique encounters is the hig...
research
05/24/2022

Bayesian Functional Principal Components Analysis using Relaxed Mutually Orthogonal Processes

Functional Principal Component Analysis (FPCA) is a prominent tool to ch...
research
08/24/2022

Discovering latent topology and geometry in data: a law of large dimension

Complex topological and geometric patterns often appear embedded in high...
research
01/27/2022

A projection based approach for interactive fixed effects panel data models

This paper presents a new approach to estimation and inference in panel ...
research
07/15/2023

Corrected kernel principal component analysis for model structural change detection

This paper develops a method to detect model structural changes by apply...
research
03/08/2017

Exact Dimensionality Selection for Bayesian PCA

We present a Bayesian model selection approach to estimate the intrinsic...
research
04/30/2021

Latent Factor Decomposition Model: Applications for Questionnaire Data

The analysis of clinical questionnaire data comes with many inherent cha...

Please sign up or login with your details

Forgot password? Click here to reset