A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data

01/18/2018
by   Yunchuan Kong, et al.
0

Gene expression data represents a unique challenge in predictive model building, because of the small number of samples (n) compared to the huge amount of features (p). This "n<<p" property has hampered application of deep learning techniques for disease outcome classification. Sparse learning by incorporating external gene network information could be a potential solution to this issue. Still, the problem is very challenging because (1) there are tens of thousands of features and only hundreds of training samples, (2) the scale-free structure of the gene network is unfriendly to the setup of convolutional neural networks. To address these issues and build a robust classification model, we propose the Graph-Embedded Deep Feedforward Networks (GEDFN), to integrate external relational information of features into the deep neural network architecture. The method is able to achieve sparse connection between network layers to prevent overfitting. To validate the method's capability, we conducted both simulation experiments and a real data analysis using a breast cancer RNA-seq dataset from The Cancer Genome Atlas (TCGA). The resulting high classification accuracy and easily interpretable feature selection results suggest the method is a useful addition to the current classification models and feature selection procedures. The method is available at https://github.com/yunchuankong/NetworkNeuralNetwork.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2019

forgeNet: A graph deep neural network model using tree-based ensemble classifiers for feature extraction

A unique challenge in predictive model building for omics data has been ...
research
02/15/2023

Interpretable Deep Learning Methods for Multiview Learning

Technological advances have enabled the generation of unique and complem...
research
01/26/2019

Sparse evolutionary Deep Learning with over one million artificial neurons on commodity hardware

Microarray gene expression has widely attracted the eyes of the public a...
research
05/06/2012

TIGRESS: Trustful Inference of Gene REgulation using Stability Selection

Inferring the structure of gene regulatory networks (GRN) from gene expr...
research
08/02/2023

Evaluation of network-guided random forest for disease gene discovery

Gene network information is believed to be beneficial for disease module...
research
03/09/2018

Breast Tumor Classification Based on Decision Information Genes and Inverse Projection Sparse Representation

Microarray gene expression data-based breast tumor classification is an ...
research
12/11/2018

Classification of Cervical Cancer Dataset

Cervical cancer is the leading gynecological malignancy worldwide. This ...

Please sign up or login with your details

Forgot password? Click here to reset