An end-to-end framework for gene expression classification by integrating a background knowledge graph: application to cancer prognosis prediction

06/29/2023
by   Kazuma Inoue, et al.
0

Biological data may be separated into primary data, such as gene expression, and secondary data, such as pathways and protein-protein interactions. Methods using secondary data to enhance the analysis of primary data are promising, because secondary data have background information that is not included in primary data. In this study, we proposed an end-to-end framework to integrally handle secondary data to construct a classification model for primary data. We applied this framework to cancer prognosis prediction using gene expression data and a biological network. Cross-validation results indicated that our model achieved higher accuracy compared with a deep neural network model without background biological network information. Experiments conducted in patient groups by cancer type showed improvement in ROC-area under the curve for many groups. Visualizations of high accuracy cancer types identified contributing genes and pathways by enrichment analysis. Known biomarkers and novel biomarker candidates were identified through these experiments.

READ FULL TEXT
research
12/20/2018

A Method to Facilitate Cancer Detection and Type Classification from Gene Expression Data using a Deep Autoencoder and Neural Network

With the increased affordability and availability of whole-genome sequen...
research
03/28/2023

Genetic Analysis of Prostate Cancer with Computer Science Methods

Metastatic prostate cancer is one of the most common cancers in men. In ...
research
05/07/2020

Improving supervised prediction of aging-related genes via dynamic network analysis

This study focuses on supervised prediction of aging-related genes from ...
research
06/18/2019

Convolutional neural network models for cancer type prediction based on gene expression

Background Precise prediction of cancer types is vital for cancer diagno...
research
09/24/2019

Symplectic P-stable Additive Runge–Kutta Methods

Symplectic partitioned Runge–Kutta methods can be obtained from a variat...
research
05/08/2020

Multi-Phase Cross-modal Learning for Noninvasive Gene Mutation Prediction in Hepatocellular Carcinoma

Hepatocellular carcinoma (HCC) is the most common type of primary liver ...
research
04/29/2020

A non-parametric Hawkes process model of primary and secondary accidents on a UK smart motorway

A self-exciting spatio-temporal point process is fitted to incident data...

Please sign up or login with your details

Forgot password? Click here to reset