On the Incommensurability Phenomenon

01/09/2013
by   Donniell E. Fishkind, et al.
0

Suppose that two large, multi-dimensional data sets are each noisy measurements of the same underlying random process, and principle components analysis is performed separately on the data sets to reduce their dimensionality. In some circumstances it may happen that the two lower-dimensional data sets have an inordinately large Procrustean fitting-error between them. The purpose of this manuscript is to quantify this "incommensurability phenomenon." In particular, under specified conditions, the square Procrustean fitting-error of the two normalized lower-dimensional data sets is (asymptotically) a convex combination (via a correlation parameter) of the Hausdorff distance between the projection subspaces and the maximum possible value of the square Procrustean fitting-error for normalized data. We show how this gives rise to the incommensurability phenomenon, and we employ illustrative simulations as well as a real data experiment to explore how the incommensurability phenomenon may have an appreciable impact.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Confluent-Drawing Parallel Coordinates: Web-Based Interactive Visual Analytics of Large Multi-Dimensional Data

Parallel coordinates plot is one of the most popular and widely used vis...
research
02/03/2020

Common Information Components Analysis

We give an information-theoretic interpretation of Canonical Correlation...
research
02/18/2022

Testing the boundaries: Normalizing Flows for higher dimensional data sets

Normalizing Flows (NFs) are emerging as a powerful class of generative m...
research
01/20/2017

The biglasso Package: A Memory- and Computation-Efficient Solver for Lasso Model Fitting with Big Data in R

Penalized regression models such as the lasso have been extensively appl...
research
11/05/2022

Modeling Multi-Dimensional Datasets via a Fast Scale-Free Network Model

Compared with network datasets, multi-dimensional data are much more com...
research
01/31/2019

Determining the Dimension and Structure of the Subspace Correlated Across Multiple Data Sets

Detecting the components common or correlated across multiple data sets ...
research
03/11/2019

Fitting Tractable Convex Sets to Support Function Evaluations

The geometric problem of estimating an unknown compact convex set from e...

Please sign up or login with your details

Forgot password? Click here to reset