Comparison between instrumental variable and mediation-based methods for reconstructing causal gene networks in yeast

10/14/2020
by   Adriaan-Alexander Ludl, et al.
0

Causal gene networks model the flow of information within a cell, but reconstructing them from omics data is challenging because correlation does not imply causation. Combining genomics and transcriptomics data from a segregating population allows to orient the direction of causality between gene expression traits using genomic variants. Instrumental-variable methods (IV) use a local expression quantitative trait locus (eQTL) as a randomized instrument for a gene's expression level, and assign target genes based on distal eQTL associations. Mediation-based methods (ME) additionally require that distal eQTL associations are mediated by the source gene. Here we used Findr, a software providing uniform implementations of IV, ME, and coexpression-based methods, a recent dataset of 1,012 segregants from a cross between two budding yeast strains, and the YEASTRACT database of known transcriptional interactions to compare causal gene network inference methods. We found that causal inference methods result in a significant overlap with the ground-truth, whereas coexpression did not perform better than random. A subsampling analysis revealed that the performance of ME decreases at large sample sizes, due to a loss of sensitivity when residual correlations become significant. IV methods contain false positive predictions, due to genomic linkage between eQTL instruments. IV and ME methods also have complementary roles for identifying causal genes underlying transcriptional hotspots. IV methods correctly predicted STB5 targets for a hotspot centred on the transcription factor STB5, whereas ME failed due to Stb5p auto-regulating its own expression. ME suggests a new candidate gene, DNM1, for a hotspot on Chr XII, where IV methods could not distinguish between multiple genes located within the hotspot.

READ FULL TEXT

page 7

page 17

page 18

research
10/21/2019

Hypothesis Testing in High-Dimensional Instrumental Variables Regression with an Application to Genomics Data

Gene expression and phenotype association can be affected by potential u...
research
10/18/2022

Granger causal inference on DAGs identifies genomic loci regulating transcription

When a dynamical system can be modeled as a sequence of observations, Gr...
research
09/19/2022

Inference of nonlinear causal effects with GWAS summary data

Large-scale genome-wide association studies (GWAS) have offered an excit...
research
09/05/2018

Gene Shaving using influence function of a kernel method

Identifying significant subsets of the genes, gene shaving is an essenti...
research
07/25/2022

A unified quantile framework reveals nonlinear heterogeneous transcriptome-wide associations

Transcriptome-wide association studies (TWAS) are powerful tools for ide...
research
07/28/2014

Dependence versus Conditional Dependence in Local Causal Discovery from Gene Expression Data

Motivation: Algorithms that discover variables which are causally relate...
research
06/05/2018

MRPC: An R package for accurate inference of causal graphs

We present MRPC, an R package that learns causal graphs with improved ac...

Please sign up or login with your details

Forgot password? Click here to reset