Vine dependence graphs with latent variables as summaries for gene expression data

03/02/2023
by   Xinyao Fan, et al.
0

The advent of high-throughput sequencing technologies has lead to vast comparative genome sequences. The construction of gene-gene interaction networks or dependence graphs on the genome scale is vital for understanding the regulation of biological processes. Different dependence graphs can provide different information. Some existing methods for dependence graphs based on high-order partial correlations are sparse and not informative when there are latent variables that can explain much of the dependence in groups of genes. Other methods of dependence graphs based on correlations and first-order partial correlations might have dense graphs. When genes can be divided into groups with stronger within group dependence in gene expression than between group dependence, we present a dependence graph based on truncated vines with latent variables that makes use of group information and low-order partial correlations. The graphs are not dense, and the genes that might be more central have more neighbors in the vine dependency graph. We demonstrate the use of our dependence graph construction on two RNA-seq data sets – yeast and prostate cancer. There is some biological evidence to support the relationship between genes in the resulting dependence graphs. A flexible framework is provided for building dependence graphs via low-order partial correlations and formation of groups, leading to graphs that are not too sparse or dense. We anticipate that this approach will help to identify groups that might be central to different biological functions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2011

Group Lasso with Overlaps: the Latent Group Lasso approach

We study a norm for structured sparsity which leads to sparse linear pre...
research
05/06/2019

Analysis of Gene Interaction Graphs for Biasing Machine Learning Models

Gene interaction graphs aim to capture various relationships between gen...
research
10/21/2019

Is graph biased feature selection of genes better than random?

Gene interaction graphs aim to capture various relationships between gen...
research
11/11/2022

Graph-Conditioned MLP for High-Dimensional Tabular Biomedical Data

Genome-wide studies leveraging recent high-throughput sequencing technol...
research
06/29/2021

Learning latent causal graphs via mixture oracles

We study the problem of reconstructing a causal graphical model from dat...
research
11/27/2019

Modelling dependence within and across run-off triangles for claims reserving

We propose a stochastic model for claims reserving that captures depende...

Please sign up or login with your details

Forgot password? Click here to reset