A Scalable Conditional Independence Test for Nonlinear, Non-Gaussian Data

01/20/2014
by   Joseph D. Ramsey, et al.
0

Many relations of scientific interest are nonlinear, and even in linear systems distributions are often non-Gaussian, for example in fMRI BOLD data. A class of search procedures for causal relations in high dimensional data relies on sample derived conditional independence decisions. The most common applications rely on Gaussian tests that can be systematically erroneous in nonlinear non-Gaussian cases. Recent work (Gretton et al. (2009), Tillman et al. (2009), Zhang et al. (2011)) has proposed conditional independence tests using Reproducing Kernel Hilbert Spaces (RKHS). Among these, perhaps the most efficient has been KCI (Kernel Conditional Independence, Zhang et al. (2011)), with computational requirements that grow effectively at least as O(N3), placing it out of range of large sample size analysis, and restricting its applicability to high dimensional data sets. We propose a class of O(N2) tests using conditional correlation independence (CCI) that require a few seconds on a standard workstation for tests that require tens of minutes to hours for the KCI method, depending on degree of parallelization, with similar accuracy. For accuracy on difficult nonlinear, non-Gaussian data sets, we also compare a recent test due to Harris & Drton (2012), applicable to nonlinear, non-Gaussian distributions in the Gaussian copula, as well as to partial correlation, a linear Gaussian test.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2020

A Bayesian Nonparametric Conditional Two-sample Test with an Application to Local Causal Discovery

The performance of constraint-based causal discovery algorithms is promi...
research
05/07/2015

Effects of Nonparanormal Transform on PC and GES Search Accuracies

Liu, et al., 2009 developed a transformation of a class of non-Gaussian ...
research
07/06/2022

Comments on "Testing Conditional Independence of Discrete Distributions"

In this short note, we identify and address an error in the proof of The...
research
02/13/2017

Approximate Kernel-based Conditional Independence Tests for Fast Non-Parametric Causal Discovery

Constraint-based causal discovery (CCD) algorithms require fast and accu...
research
07/03/2019

mgcpy: A Comprehensive High Dimensional Independence Testing Python Package

With the increase in the amount of data in many fields, a method to cons...
research
06/02/2023

Theoretical Behavior of XAI Methods in the Presence of Suppressor Variables

In recent years, the community of 'explainable artificial intelligence' ...
research
10/24/2018

Notes on asymptotics of sample eigenstructure for spiked covariance models with non-Gaussian data

These expository notes serve as a reference for an accompanying post Mor...

Please sign up or login with your details

Forgot password? Click here to reset