Improving the Accuracy of Principal Component Analysis by the Maximum Entropy Method

07/24/2019
by   Guihong Wan, et al.
34

Classical Principal Component Analysis (PCA) approximates data in terms of projections on a small number of orthogonal vectors. There are simple procedures to efficiently compute various functions of the data from the PCA approximation. The most important function is arguably the Euclidean distance between data items, This can be used, for example, to solve the approximate nearest neighbor problem. We use random variables to model the inherent uncertainty in such approximations, and apply the Maximum Entropy Method to infer the underlying probability distribution. We propose using the expected values of distances between these random variables as improved estimates of the distance. We show by analysis and experimentally that in most cases results obtained by our method are more accurate than what is obtained by the classical approach. This improves the accuracy of a classical technique that have been used with little change for over 100 years.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/14/2023

Informational Rescaling of PCA Maps with Application to Genetic Distance

We discuss the inadequacy of covariances/correlations and other measures...
research
08/13/2019

Principal symmetric space analysis

We develop a novel analogue of Euclidean PCA (principal component analys...
research
12/02/2015

Optimal whitening and decorrelation

Whitening, or sphering, is a common preprocessing step in statistical an...
research
09/29/2013

Rotationally Invariant Image Representation for Viewing Direction Classification in Cryo-EM

We introduce a new rotationally invariant viewing angle classification m...
research
07/23/2019

NPSA: Nonorthogonal Principal Skewness Analysis

Principal skewness analysis (PSA) has been introduced for feature extrac...
research
10/16/2017

Geometric Learning and Filtering in Finance

We develop a method for incorporating relevant non-Euclidean geometric i...
research
07/15/2019

Robust Nonlinear Component Estimation with Tikhonov Regularization

Learning reduced component representations of data using nonlinear trans...

Please sign up or login with your details

Forgot password? Click here to reset