Is graph biased feature selection of genes better than random?

10/21/2019
by   Mohammad Hashir, et al.
0

Gene interaction graphs aim to capture various relationships between genes and can represent decades of biology research. When trying to make predictions from genomic data, those graphs could be used to overcome the curse of dimensionality by making machine learning models sparser and more biased with biological common knowledge. In this work, we focus on assessing whether those graphs capture dependencies seen in gene expression data better than random. We formulate a condition that graphs should satisfy to provide a good bias and propose to test it using a 'Single Gene Inference' (SGI) task. We compare random graphs with seven major gene interaction graphs published by different research groups, aiming to measure the true benefit of using biologically relevant graphs in this context. Our analysis finds that dependencies can be captured almost as well at random which suggests that, in terms of gene expression levels, the relevant information about the state of the cell is spread across many genes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2019

Analysis of Gene Interaction Graphs for Biasing Machine Learning Models

Gene interaction graphs aim to capture various relationships between gen...
research
06/18/2018

Towards Gene Expression Convolutions using Gene Interaction Graphs

We study the challenges of applying deep learning to gene expression dat...
research
01/29/2021

A principle feature analysis

A key task of data science is to identify relevant features linked to ce...
research
03/02/2023

Vine dependence graphs with latent variables as summaries for gene expression data

The advent of high-throughput sequencing technologies has lead to vast c...
research
06/05/2023

Graph Fourier MMD for Signals on Graphs

While numerous methods have been proposed for computing distances betwee...
research
01/08/2022

Automatically layout and visualize the biological pathway map with spectral graph theory

The pathway is a biological term that refers to a series of interactions...
research
03/03/2022

From local to global gene co-expression estimation using single-cell RNA-seq data

In genomics studies, the investigation of the gene relationship often br...

Please sign up or login with your details

Forgot password? Click here to reset