Causal Discovery and Optimal Experimental Design for Genome-Scale Biological Network Recovery

04/06/2023
by   Ashka Shah, et al.
0

Causal discovery of genome-scale networks is important for identifying pathways from genes to observable traits - e.g. differences in cell function, disease, drug resistance and others. Causal learners based on graphical models rely on interventional samples to orient edges in the network. However, these models have not been shown to scale up the size of the genome, which are on the order of 1e3-1e4 genes. We introduce a new learner, SP-GIES, that jointly learns from interventional and observational datasets and achieves almost 4x speedup against an existing learner for 1,000 node networks. SP-GIES achieves an AUC-PR score of 0.91 on 1,000 node networks, and scales up to 2,000 node networks - this is 4x larger than existing works. We also show how SP-GIES improves downstream optimal experimental design strategies for selecting interventional experiments to perform on the system. This is an important step forward in realizing causal discovery at scale via autonomous experimental design.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2022

Interventions, Where and How? Experimental Design for Causal Models at Scale

Causal discovery from observational and interventional data is challengi...
research
01/03/2023

Causal Discovery for Gene Regulatory Network Prediction

Biological systems and processes are networks of complex nonlinear regul...
research
02/27/2019

ABCD-Strategy: Budgeted Experimental Design for Targeted Causal Structure Discovery

Determining the causal structure of a set of variables is critical for b...
research
03/06/2023

Boosting Differentiable Causal Discovery via Adaptive Sample Reweighting

Under stringent model type and variable distribution assumptions, differ...
research
03/08/2022

Score matching enables causal discovery of nonlinear additive noise models

This paper demonstrates how to recover causal graphs from the score of t...
research
08/24/2023

Human Comprehensible Active Learning of Genome-Scale Metabolic Networks

An important application of Synthetic Biology is the engineering of the ...
research
08/10/2018

Genome-Wide Association Studies: Information Theoretic Limits of Reliable Learning

In the problems of Genome-Wide Association Study (GWAS), the objective i...

Please sign up or login with your details

Forgot password? Click here to reset