Testing Causality for High Dimensional Data

03/14/2023
by   Arun Jambulapati, et al.
0

Determining causal relationship between high dimensional observations are among the most important tasks in scientific discoveries. In this paper, we revisited the linear trace method, a technique proposed in <cit.> to infer the causal direction between two random variables of high dimensions. We strengthen the existing results significantly by providing an improved tail analysis in addition to extending the results to nonlinear trace functionals with sharper confidence bounds under certain distributional assumptions. We obtain our results by interpreting the trace estimator in the causal regime as a function over random orthogonal matrices, where the concentration of Lipschitz functions over such space could be applied. We additionally propose a novel ridge-regularized variant of the estimator in <cit.>, and give provable bounds relating the ridge-estimated terms to their ground-truth counterparts. We support our theoretical results with encouraging experiments on synthetic datasets, more prominently, under high-dimension low sample size regime.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2012

Testing whether linear equations are causal: A free probability theory approach

We propose a method that infers whether linear relations between two hig...
research
01/02/2020

Modified Pillai's trace statistics for two high-dimensional sample covariance matrices

The goal of this study was to test the equality of two covariance matric...
research
07/05/2016

Risk Bounds for High-dimensional Ridge Function Combinations Including Neural Networks

Let f^ be a function on R^d satisfying a spectral norm condition. Fo...
research
12/09/2022

Deep Learning of Causal Structures in High Dimensions

Recent years have seen rapid progress at the intersection between causal...
research
05/28/2020

Ridge TRACE Diagnostics

We describe a new p-parameter generalized ridge-regression shrinkage-pat...
research
07/25/2020

A finite sample analysis of the double descent phenomenon for ridge function estimation

Recent extensive numerical experiments in high scale machine learning ha...
research
09/14/2015

Markov Boundary Discovery with Ridge Regularized Linear Models

Ridge regularized linear models (RRLMs), such as ridge regression and th...

Please sign up or login with your details

Forgot password? Click here to reset