Determining the Dimension and Structure of the Subspace Correlated Across Multiple Data Sets

by   Tanuj Hasija, et al.

Detecting the components common or correlated across multiple data sets is challenging due to a large number of possible correlation structures among the components. Even more challenging is to determine the precise structure of these correlations. Traditional work has focused on determining only the model order, i.e., the dimension of the correlated subspace, a number that depends on how the model-order problem is defined. Moreover, identifying the model order is often not enough to understand the relationship among the components in different data sets. We aim at solving the complete modelselection problem, i.e., determining which components are correlated across which data sets. We prove that the eigenvalues and eigenvectors of the normalized covariance matrix of the composite data vector, under certain conditions, completely characterize the underlying correlation structure. We use these results to solve the model-selection problem by employing bootstrap-based hypothesis testing.



There are no comments yet.



Estimating Number of Factors by Adjusted Eigenvalues Thresholding

Determining the number of common factors is an important and practical t...

Simultaneous Estimation of Non-Gaussian Components and their Correlation Structure

The statistical dependencies which independent component analysis (ICA) ...

Common Information Components Analysis

We give an information-theoretic interpretation of Canonical Correlation...

Dimension-Free Anticoncentration Bounds for Gaussian Order Statistics with Discussion of Applications to Multiple Testing

The following anticoncentration property is proved. The probability that...

On the Incommensurability Phenomenon

Suppose that two large, multi-dimensional data sets are each noisy measu...

Discovering Relationships and their Structures Across Disparate Data Modalities

Determining whether certain properties are related to other properties i...

Finding Stable Groups of Cross-Correlated Features in Multi-View data

Multi-view data, in which data of different types are obtained from a co...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.