mgcpy: A Comprehensive High Dimensional Independence Testing Python Package

07/03/2019
by   Sambit Panda, et al.
1

With the increase in the amount of data in many fields, a method to consistently and efficiently decipher relationships within high dimensional data sets is important. Because many modern datasets are high-dimensional, univariate independence tests are not applicable. While many multivariate independence tests have R packages available, the interfaces are inconsistent, most are not available in Python. mgcpy is an extensive Python library that includes many state of the art high-dimensional independence testing procedures using a common interface. The package is easy-to-use and is flexible enough to enable future extensions. This manuscript provides details for each of the tests as well as extensive power and run-time benchmarks on a suite of high-dimensional simulations previously used in different publications. The appendix includes demonstrations of how the user can interact with the package, as well as links and documentation.

READ FULL TEXT

page 1

page 10

research
07/03/2019

hyppo: A Comprehensive Multivariate Hypothesis Testing Python Package

We introduce hyppo, a unified library for performing multivariate hypoth...
research
11/11/2021

Simulating High-Dimensional Multivariate Data using the bigsimr R Package

It is critical to accurately simulate data when employing Monte Carlo te...
research
06/16/2021

Manifolds.jl: An Extensible Julia Framework for Data Analysis on Manifolds

We present the Julia package Manifolds.jl, providing a fast and easy-to-...
research
01/20/2014

A Scalable Conditional Independence Test for Nonlinear, Non-Gaussian Data

Many relations of scientific interest are nonlinear, and even in linear ...
research
11/16/2017

Predictive Independence Testing, Predictive Conditional Independence Testing, and Predictive Graphical Modelling

Testing (conditional) independence of multivariate random variables is a...
research
10/15/2018

Exploratory Mediation Analysis with Many Potential Mediators

Social and behavioral scientists are increasingly employing technologies...
research
10/11/2015

ParallelPC: an R package for efficient constraint based causal exploration

Discovering causal relationships from data is the ultimate goal of many ...

Please sign up or login with your details

Forgot password? Click here to reset