ParallelPC: an R package for efficient constraint based causal exploration

10/11/2015
by   Thuc Duy Le, et al.
0

Discovering causal relationships from data is the ultimate goal of many research areas. Constraint based causal exploration algorithms, such as PC, FCI, RFCI, PC-simple, IDA and Joint-IDA have achieved significant progress and have many applications. A common problem with these methods is the high computational complexity, which hinders their applications in real world high dimensional datasets, e.g gene expression datasets. In this paper, we present an R package, ParallelPC, that includes the parallelised versions of these causal exploration algorithms. The parallelised algorithms help speed up the procedure of experimenting big datasets and reduce the memory used when running the algorithms. The package is not only suitable for super-computers or clusters, but also convenient for researchers using personal computers with multi core CPUs. Our experiment results on real world datasets show that using the parallelised algorithms it is now practical to explore causal relationships in high dimensional datasets with thousands of variables in a single multicore computer. ParallelPC is available in CRAN repository at https://cran.rproject.org/web/packages/ParallelPC/index.html.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2019

Causal Discovery Toolbox: Uncover causal relationships in Python

This paper presents a new open source Python framework for causal discov...
research
10/06/2019

Boosting Local Causal Discovery in High-Dimensional Expression Data

We study how well Local Causal Discovery (LCD), a simple and efficient c...
research
11/12/2016

A Review on Algorithms for Constraint-based Causal Discovery

Causal discovery studies the problem of mining causal relationships betw...
research
08/30/2021

A practical guide to causal discovery with cohort data

In this guide, we present how to perform constraint-based causal discove...
research
03/11/2017

Learning Large-Scale Bayesian Networks with the sparsebn Package

Learning graphical models from data is an important problem with wide ap...
research
07/03/2019

mgcpy: A Comprehensive High Dimensional Independence Testing Python Package

With the increase in the amount of data in many fields, a method to cons...
research
04/05/2018

5PEN TECHNOLOGY: A New Dawn in Homogeneous and Heterogeneous Computing

This research work is a pair review into the conceptual frame work and i...

Please sign up or login with your details

Forgot password? Click here to reset