Determining the Dimension and Structure of the Subspace Correlated Across Multiple Data Sets

01/31/2019
by   Tanuj Hasija, et al.
0

Detecting the components common or correlated across multiple data sets is challenging due to a large number of possible correlation structures among the components. Even more challenging is to determine the precise structure of these correlations. Traditional work has focused on determining only the model order, i.e., the dimension of the correlated subspace, a number that depends on how the model-order problem is defined. Moreover, identifying the model order is often not enough to understand the relationship among the components in different data sets. We aim at solving the complete modelselection problem, i.e., determining which components are correlated across which data sets. We prove that the eigenvalues and eigenvectors of the normalized covariance matrix of the composite data vector, under certain conditions, completely characterize the underlying correlation structure. We use these results to solve the model-selection problem by employing bootstrap-based hypothesis testing.

READ FULL TEXT
research
05/30/2023

Identifying the Complete Correlation Structure in Large-Scale High-Dimensional Data Sets with Local False Discovery Rates

The identification of the dependent components in multiple data sets is ...
research
09/24/2019

Estimating Number of Factors by Adjusted Eigenvalues Thresholding

Determining the number of common factors is an important and practical t...
research
06/18/2015

Simultaneous Estimation of Non-Gaussian Components and their Correlation Structure

The statistical dependencies which independent component analysis (ICA) ...
research
02/03/2020

Common Information Components Analysis

We give an information-theoretic interpretation of Canonical Correlation...
research
07/22/2021

Dimension-Free Anticoncentration Bounds for Gaussian Order Statistics with Discussion of Applications to Multiple Testing

The following anticoncentration property is proved. The probability that...
research
01/09/2013

On the Incommensurability Phenomenon

Suppose that two large, multi-dimensional data sets are each noisy measu...
research
09/10/2020

Finding Stable Groups of Cross-Correlated Features in Multi-View data

Multi-view data, in which data of different types are obtained from a co...

Please sign up or login with your details

Forgot password? Click here to reset