forgeNet: A graph deep neural network model using tree-based ensemble classifiers for feature extraction

05/23/2019
by   Yunchuan Kong, et al.
0

A unique challenge in predictive model building for omics data has been the small number of samples (n) versus the large amount of features (p). This "n≪ p" property brings difficulties for disease outcome classification using deep learning techniques. Sparse learning by incorporating external gene network information such as the graph-embedded deep feedforward network (GEDFN) model has been a solution to this issue. However, such methods require an existing feature graph, and potential mis-specification of the feature graph can be harmful on classification and feature selection. To address this limitation and develop a robust classification model without relying on external knowledge, we propose a forest graph-embedded deep feedforward network (forgeNet) model, to integrate the GEDFN architecture with a forest feature graph extractor, so that the feature graph can be learned in a supervised manner and specifically constructed for a given prediction task. To validate the method's capability, we experimented the forgeNet model with both synthetic and real datasets. The resulting high classification accuracy suggests that the method is a valuable addition to sparse deep learning models for omics data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2018

A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data

Gene expression data represents a unique challenge in predictive model b...
research
01/30/2022

Sparse Centroid-Encoder: A Nonlinear Model for Feature Selection

We develop a sparse optimization problem for the determination of the to...
research
09/18/2020

Optimizing Speech Emotion Recognition using Manta-Ray Based Feature Selection

Emotion recognition from audio signals has been regarded as a challengin...
research
01/23/2020

FsNet: Feature Selection Network on High-dimensional Biological Data

Biological data are generally high-dimensional and require efficient mac...
research
06/20/2020

Deep Double-Side Learning Ensemble Model for Few-Shot Parkinson Speech Recognition

Diagnosis and therapeutic effect assessment of Parkinson disease based o...
research
03/09/2018

Breast Tumor Classification Based on Decision Information Genes and Inverse Projection Sparse Representation

Microarray gene expression data-based breast tumor classification is an ...
research
01/26/2019

Sparse evolutionary Deep Learning with over one million artificial neurons on commodity hardware

Microarray gene expression has widely attracted the eyes of the public a...

Please sign up or login with your details

Forgot password? Click here to reset