Predictive Independence Testing, Predictive Conditional Independence Testing, and Predictive Graphical Modelling

11/16/2017
by   Samuel Burkart, et al.
0

Testing (conditional) independence of multivariate random variables is a task central to statistical inference and modelling in general - though unfortunately one for which to date there does not exist a practicable workflow. State-of-art workflows suffer from the need for heuristic or subjective manual choices, high computational complexity, or strong parametric assumptions. We address these problems by establishing a theoretical link between multivariate/conditional independence testing, and model comparison in the multivariate predictive modelling aka supervised learning task. This link allows advances in the extensively studied supervised learning workflow to be directly transferred to independence testing workflows - including automated tuning of machine learning type which addresses the need for a heuristic choice, the ability to quantitatively trade-off computational demand with accuracy, and the modern black-box philosophy for checking and interfacing. As a practical implementation of this link between the two workflows, we present a python package 'pcit', which implements our novel multivariate and conditional independence tests, interfacing the supervised learning API of the scikit-learn package. Theory and package also allow for straightforward independence test based learning of graphical model structure. We empirically show that our proposed predictive independence test outperform or are on par to current practice, and the derived graphical model structure learning algorithms asymptotically recover the 'true' graph. This paper, and the 'pcit' package accompanying it, thus provide powerful, scalable, generalizable, and easy-to-use methods for multivariate and conditional independence testing, as well as for graphical model structure learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2018

Probabilistic supervised learning

Predictive modelling and supervised learning are central to modern data ...
research
01/28/2019

Testing Conditional Predictive Independence in Supervised Learning Algorithms

We propose a general test of conditional independence. The conditional p...
research
07/01/2021

A conditional independence test for causality in econometrics

The Y-test is a useful tool for detecting missing confounders in the con...
research
07/05/2023

Conditional independence testing under model misspecification

Conditional independence (CI) testing is fundamental and challenging in ...
research
07/03/2019

hyppo: A Comprehensive Multivariate Hypothesis Testing Python Package

We introduce hyppo, a unified library for performing multivariate hypoth...
research
07/03/2019

mgcpy: A Comprehensive High Dimensional Independence Testing Python Package

With the increase in the amount of data in many fields, a method to cons...
research
01/16/2013

YGGDRASIL - A Statistical Package for Learning Split Models

There are two main objectives of this paper. The first is to present a s...

Please sign up or login with your details

Forgot password? Click here to reset