Sifting Common Information from Many Variables

06/07/2016
by   Greg Ver Steeg, et al.
0

Measuring the relationship between any pair of variables is a rich and active area of research that is central to scientific practice. In contrast, characterizing the common information among any group of variables is typically a theoretical exercise with few practical methods for high-dimensional data. A promising solution would be a multivariate generalization of the famous Wyner common information, but this approach relies on solving an apparently intractable optimization problem. We leverage the recently introduced information sieve decomposition to formulate an incremental version of the common information problem that admits a simple fixed point solution, fast convergence, and complexity that is linear in the number of variables. This scalable approach allows us to demonstrate the usefulness of common information in high-dimensional learning problems. The sieve outperforms standard methods on dimensionality reduction tasks, solves a blind source separation problem that cannot be solved with ICA, and accurately recovers structure in brain imaging data.

READ FULL TEXT
research
06/04/2023

Prescriptive PCA: Dimensionality Reduction for Two-stage Stochastic Optimization

In this paper, we consider the alignment between an upstream dimensional...
research
02/18/2022

Incorporating Texture Information into Dimensionality Reduction for High-Dimensional Images

High-dimensional imaging is becoming increasingly relevant in many field...
research
05/23/2019

Unsupervised Discovery of Temporal Structure in Noisy Data with Dynamical Components Analysis

Linear dimensionality reduction methods are commonly used to extract low...
research
09/26/2013

Inverse Covariance Estimation for High-Dimensional Data in Linear Time and Space: Spectral Methods for Riccati and Sparse Models

We propose maximum likelihood estimation for learning Gaussian graphical...
research
07/29/2015

Fast Robust PCA on Graphs

Mining useful clusters from high dimensional data has received significa...
research
06/22/2020

Latent feature sharing: an adaptive approach to linear decomposition models

Latent feature models are canonical tools for exploratory analysis in cl...
research
10/27/2014

Maximally Informative Hierarchical Representations of High-Dimensional Data

We consider a set of probabilistic functions of some input variables as ...

Please sign up or login with your details

Forgot password? Click here to reset