Using ontology embeddings for structural inductive bias in gene expression data analysis

11/22/2020
by   Maja Trȩbacz, et al.
14

Stratifying cancer patients based on their gene expression levels allows improving diagnosis, survival analysis and treatment planning. However, such data is extremely highly dimensional as it contains expression values for over 20000 genes per patient, and the number of samples in the datasets is low. To deal with such settings, we propose to incorporate prior biological knowledge about genes from ontologies into the machine learning system for the task of patient classification given their gene expression data. We use ontology embeddings that capture the semantic similarities between the genes to direct a Graph Convolutional Network, and therefore sparsify the network connections. We show this approach provides an advantage for predicting clinical targets from high-dimensional low-sample data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2019

Identify Statistical Similarities and Differences Between the Deadliest Cancer Types Through Gene Expression

Prognostic genes have been well studied within each type of cancer. Howe...
research
11/11/2022

Graph-Conditioned MLP for High-Dimensional Tabular Biomedical Data

Genome-wide studies leveraging recent high-throughput sequencing technol...
research
06/18/2018

Towards Gene Expression Convolutions using Gene Interaction Graphs

We study the challenges of applying deep learning to gene expression dat...
research
11/09/2020

Stratification of Systemic Lupus Erythematosus Patients Using Gene Expression Data to Reveal Expression of Distinct Immune Pathways

Systemic lupus erythematosus (SLE) is the tenth leading cause of death i...
research
07/12/2017

Elephant Search with Deep Learning for Microarray Data Analysis

Even though there is a plethora of research in Microarray gene expressio...
research
03/25/2019

Gene Expression based Survival Prediction for Cancer Patients: A Topic Modeling Approach

Cancer is one of the leading cause of death, worldwide. Many believe tha...
research
07/12/2019

Predicting phenotypes from microarrays using amplified, initially marginal, eigenvector regression

Motivation: The discovery of relationships between gene expression measu...

Please sign up or login with your details

Forgot password? Click here to reset