Switched linear projections and inactive state sensitivity for deep neural network interpretability

09/25/2019
by   Lech Szymanski, et al.
19

We introduce switched linear projections for expressing the activity of a neuron in a ReLU-based deep neural network in terms of a single linear projection in the input space. The method works by isolating the active subnetwork, a series of linear transformations, that completely determine the entire computation of the deep network for a given input instance. We also propose that for interpretability it is more instructive and meaningful to focus on the patterns that deactive the neurons in the network, which are ignored by the exisiting methods that implicitly track only the active aspect of the network's computation. We introduce a novel interpretability method for the inactive state sensitivity (Insens). Comparison against existing methods shows that Insens is more robust (in the presence of noise), more complete (in terms of patterns that affect the computation) and a very effective interpretability method for deep neural networks.

READ FULL TEXT

page 6

page 8

research
02/07/2021

SeReNe: Sensitivity based Regularization of Neurons for Structured Sparsity in Neural Networks

Deep neural networks include millions of learnable parameters, making th...
research
10/11/2021

NFT-K: Non-Fungible Tangent Kernels

Deep neural networks have become essential for numerous applications due...
research
08/30/2021

Neuron-level Interpretation of Deep NLP Models: A Survey

The proliferation of deep neural networks in various domains has seen an...
research
10/22/2020

Towards falsifiable interpretability research

Methods for understanding the decisions of and mechanisms underlying dee...
research
02/17/2020

Investigating the Compositional Structure Of Deep Neural Networks

The current understanding of deep neural networks can only partially exp...
research
10/11/2018

Learning Optimal Deep Projection of ^18F-FDG PET Imaging for Early Differential Diagnosis of Parkinsonian Syndromes

Several diseases of parkinsonian syndromes present similar symptoms at e...
research
12/28/2020

A Survey on Neural Network Interpretability

Along with the great success of deep neural networks, there is also grow...

Please sign up or login with your details

Forgot password? Click here to reset