Determining the Dimension and Structure of the Subspace Correlated Across Multiple Data Sets

01/31/2019
by   Tanuj Hasija, et al.
0

Detecting the components common or correlated across multiple data sets is challenging due to a large number of possible correlation structures among the components. Even more challenging is to determine the precise structure of these correlations. Traditional work has focused on determining only the model order, i.e., the dimension of the correlated subspace, a number that depends on how the model-order problem is defined. Moreover, identifying the model order is often not enough to understand the relationship among the components in different data sets. We aim at solving the complete modelselection problem, i.e., determining which components are correlated across which data sets. We prove that the eigenvalues and eigenvectors of the normalized covariance matrix of the composite data vector, under certain conditions, completely characterize the underlying correlation structure. We use these results to solve the model-selection problem by employing bootstrap-based hypothesis testing.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

09/24/2019

Estimating Number of Factors by Adjusted Eigenvalues Thresholding

Determining the number of common factors is an important and practical t...
06/18/2015

Simultaneous Estimation of Non-Gaussian Components and their Correlation Structure

The statistical dependencies which independent component analysis (ICA) ...
02/03/2020

Common Information Components Analysis

We give an information-theoretic interpretation of Canonical Correlation...
07/22/2021

Dimension-Free Anticoncentration Bounds for Gaussian Order Statistics with Discussion of Applications to Multiple Testing

The following anticoncentration property is proved. The probability that...
01/09/2013

On the Incommensurability Phenomenon

Suppose that two large, multi-dimensional data sets are each noisy measu...
09/16/2016

Discovering Relationships and their Structures Across Disparate Data Modalities

Determining whether certain properties are related to other properties i...
09/10/2020

Finding Stable Groups of Cross-Correlated Features in Multi-View data

Multi-view data, in which data of different types are obtained from a co...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.