Learning Causality: Synthesis of Large-Scale Causal Networks from High-Dimensional Time Series Data

05/06/2019
by   Mark-Oliver Stehr, et al.
0

There is an abundance of complex dynamic systems that are critical to our daily lives and our society but that are hardly understood, and even with today's possibilities to sense and collect large amounts of experimental data, they are so complex and continuously evolving that it is unlikely that their dynamics will ever be understood in full detail. Nevertheless, through computational tools we can try to make the best possible use of the current technologies and available data. We believe that the most useful models will have to take into account the imbalance between system complexity and available data in the context of limited knowledge or multiple hypotheses. The complex system of biological cells is a prime example of such a system that is studied in systems biology and has motivated the methods presented in this paper. They were developed as part of the DARPA Rapid Threat Assessment (RTA) program, which is concerned with understanding of the mechanism of action (MoA) of toxins or drugs affecting human cells. Using a combination of Gaussian processes and abstract network modeling, we present three fundamentally different machine-learning-based approaches to learn causal relations and synthesize causal networks from high-dimensional time series data. While other types of data are available and have been analyzed and integrated in our RTA work, we focus on transcriptomics (that is gene expression) data obtained from high-throughput microarray experiments in this paper to illustrate capabilities and limitations of our algorithms. Our algorithms make different but overall relatively few biological assumptions, so that they are applicable to other types of biological data and potentially even to other complex systems that exhibit high dimensionality but are not of biological nature.

READ FULL TEXT
research
08/21/2012

Learning LiNGAM based on data with more variables than observations

A very important topic in systems biology is developing statistical meth...
research
03/15/2021

SEMgraph: An R Package for Causal Network Analysis of High-Throughput Data with Structural Equation Models

With the advent of high-throughput sequencing (HTS) in molecular biology...
research
05/04/2018

Causal Queries from Observational Data in Biological Systems via Bayesian Networks: An Empirical Study in Small Networks

Biological networks are a very convenient modelling and visualisation to...
research
04/28/2018

Fuzzy logic based approaches for gene regulatory network inference

The rapid advancement in high-throughput techniques has fueled the gener...
research
05/05/2021

Granger Causality: A Review and Recent Advances

Introduced more than a half century ago, Granger causality has become a ...
research
11/11/2015

Instantaneous Modelling and Reverse Engineering of DataConsistent Prime Models in Seconds!

A theoretical framework that supports automated construction of dynamic ...
research
10/06/2020

Gene Regulatory Network Inference with Latent Force Models

Delays in protein synthesis cause a confounding effect when constructing...

Please sign up or login with your details

Forgot password? Click here to reset