Validation of nonlinear PCA

04/03/2012
by   Matthias Scholz, et al.
0

Linear principal component analysis (PCA) can be extended to a nonlinear PCA by using artificial neural networks. But the benefit of curved components requires a careful control of the model complexity. Moreover, standard techniques for model selection, including cross-validation and more generally the use of an independent test set, fail when applied to nonlinear PCA because of its inherent unsupervised characteristics. This paper presents a new approach for validating the complexity of nonlinear PCA models by using the error in missing data estimation as a criterion for model selection. It is motivated by the idea that only the model of optimal complexity is able to predict missing values with the highest accuracy. While standard test set validation usually favours over-fitted nonlinear PCA models, the proposed model validation approach correctly selects the optimal model complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/08/2019

Automatic dimensionality selection for principal component analysis models with the ignorance score

Principal component analysis (PCA) is by far the most widespread tool fo...
research
12/31/2018

The Stochastic Complexity of Principal Component Analysis

PCA (principal component analysis) and its variants are ubiquitous techn...
research
02/26/2023

Efficient fair PCA for fair representation learning

We revisit the problem of fair principal component analysis (PCA), where...
research
11/15/2020

Deep-RLS: A Model-Inspired Deep Learning Approach to Nonlinear PCA

In this work, we consider the application of model-based deep learning i...
research
02/25/2019

Logistic principal component analysis via non-convex singular value thresholding

Multivariate binary data is becoming abundant in current biological rese...
research
10/07/2021

AgFlow: Fast Model Selection of Penalized PCA via Implicit Regularization Effects of Gradient Flow

Principal component analysis (PCA) has been widely used as an effective ...
research
01/18/2023

Data thinning for convolution-closed distributions

We propose data thinning, a new approach for splitting an observation in...

Please sign up or login with your details

Forgot password? Click here to reset