Network Maximal Correlation

06/15/2016
by   Soheil Feizi, et al.
0

We introduce Network Maximal Correlation (NMC) as a multivariate measure of nonlinear association among random variables. NMC is defined via an optimization that infers transformations of variables by maximizing aggregate inner products between transformed variables. For finite discrete and jointly Gaussian random variables, we characterize a solution of the NMC optimization using basis expansion of functions over appropriate basis functions. For finite discrete variables, we propose an algorithm based on alternating conditional expectation to determine NMC. Moreover we propose a distributed algorithm to compute an approximation of NMC for large and dense graphs using graph partitioning. For finite discrete variables, we show that the probability of discrepancy greater than any given level between NMC and NMC computed using empirical distributions decays exponentially fast as the sample size grows. For jointly Gaussian variables, we show that under some conditions the NMC optimization is an instance of the Max-Cut problem. We then illustrate an application of NMC in inference of graphical model for bijective functions of jointly Gaussian variables. Finally, we show NMC's utility in a data application of learning nonlinear dependencies among genes in a cancer dataset.

READ FULL TEXT

page 10

page 11

research
04/29/2019

Extreme Nonlinear Correlation for Multiple Random Variables and Stochastic Processes with Applications to Additive Models

The maximum correlation of functions of a pair of random variables is an...
research
07/05/2021

Sets of Marginals and Pearson-Correlation-based CHSH Inequalities for a Two-Qubit System

Quantum mass functions (QMFs), which are tightly related to decoherence ...
research
09/06/2021

Bounding Means of Discrete Distributions

We introduce methods to bound the mean of a discrete distribution (or fi...
research
10/08/2019

An Information-theoretic Approach to Unsupervised Feature Selection for High-Dimensional Data

In this paper, we propose an information-theoretic approach to design th...
research
04/16/2023

Pointwise Maximal Leakage on General Alphabets

Pointwise maximal leakage (PML) is an operationally meaningful privacy m...
research
02/10/2010

A Generalization of the Chow-Liu Algorithm and its Application to Statistical Learning

We extend the Chow-Liu algorithm for general random variables while the ...
research
04/23/2014

Probabilistic graphs using coupled random variables

Neural network design has utilized flexible nonlinear processes which ca...

Please sign up or login with your details

Forgot password? Click here to reset