DeepPINK: reproducible feature selection in deep neural networks

09/04/2018
by   Yang Young Lu, et al.
0

Deep learning has become increasingly popular in both supervised and unsupervised machine learning thanks to its outstanding empirical performance. However, because of their intrinsic complexity, most deep learning methods are largely treated as black box tools with little interpretability. Even though recent attempts have been made to facilitate the interpretability of deep neural networks (DNNs), existing methods are susceptible to noise and lack of robustness. Therefore, scientists are justifiably cautious about the reproducibility of the discoveries, which is often related to the interpretability of the underlying statistical models. In this paper, we describe a method to increase the interpretability and reproducibility of DNNs by incorporating the idea of feature selection with controlled error rate. By designing a new DNN architecture and integrating it with the recently proposed knockoffs framework, we perform feature selection with a controlled error rate, while maintaining high power. This new method, DeepPINK (Deep feature selection using Paired-Input Nonlinear Knockoffs), is applied to both simulated and real data sets to demonstrate its empirical utility.

READ FULL TEXT
research
10/13/2020

Neural Gaussian Mirror for Controlled Feature Selection in Neural Networks

Deep neural networks (DNNs) have become increasingly popular and achieve...
research
05/24/2019

Deep-gKnock: nonlinear group-feature selection with deep neural network

Feature selection is central to contemporary high-dimensional data analy...
research
09/15/2022

Feature Selection integrated Deep Learning for Ultrahigh Dimensional and Highly Correlated Feature Space

In recent years, deep learning has been a topic of interest in almost al...
research
12/22/2017

Dropout Feature Ranking for Deep Learning Models

Deep neural networks are a promising technology achieving state-of-the-a...
research
10/08/2021

Graphs as Tools to Improve Deep Learning Methods

In recent years, deep neural networks (DNNs) have known an important ris...
research
09/28/2022

Variance Tolerance Factors For Interpreting Neural Networks

Black box models only provide results for deep learning tasks and lack i...
research
09/29/2021

Deep neural networks with controlled variable selection for the identification of putative causal genetic variants

Deep neural networks (DNN) have been used successfully in many scientifi...

Please sign up or login with your details

Forgot password? Click here to reset