Informational Rescaling of PCA Maps with Application to Genetic Distance

03/14/2023
by   Nassim Nicholas Taleb, et al.
0

We discuss the inadequacy of covariances/correlations and other measures in L-2 as relative distance metrics. We propose a computationally simple heuristic to transform a map based on standard principal component analysis (PCA) (when the variables are asymptotically Gaussian) into an entropy-based map where distances are based on mutual information (MI).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/22/2019

Principal Component Analysis: A Generalized Gini Approach

A principal component analysis based on the generalized Gini correlation...
research
07/24/2019

Improving the Accuracy of Principal Component Analysis by the Maximum Entropy Method

Classical Principal Component Analysis (PCA) approximates data in terms ...
research
02/09/2019

Optimal Latent Representations: Distilling Mutual Information into Principal Pairs

Principal component analysis (PCA) is generalized from one to two random...
research
05/08/2023

High-Dimensional Smoothed Entropy Estimation via Dimensionality Reduction

We study the problem of overcoming exponential sample complexity in diff...
research
12/21/2014

Correlation of Data Reconstruction Error and Shrinkages in Pair-wise Distances under Principal Component Analysis (PCA)

In this on-going work, I explore certain theoretical and empirical impli...
research
06/20/2021

Distributed Picard Iteration: Application to Distributed EM and Distributed PCA

In recent work, we proposed a distributed Picard iteration (DPI) that al...
research
09/06/2021

Making drawings speak through mathematical metrics

Figurative drawing is a skill that takes time to learn, and evolves duri...

Please sign up or login with your details

Forgot password? Click here to reset