Inverting the Feature Visualization Process for Feedforward Neural Networks

07/21/2020
by   Christian Reinbold, et al.
0

This work sheds light on the invertibility of feature visualization in neural networks. Since the input that is generated by feature visualization using activation maximization does, in general, not yield the feature objective it was optimized for, we investigate optimizing for the feature objective that yields this input. Given the objective function used in activation maximization that measures how closely a given input resembles the feature objective, we exploit that the gradient of this function w.r.t. inputs is—up to a scaling factor—linear in the objective. This observation is used to find the optimal feature objective via computing a closed form solution that minimizes the gradient. By means of Inverse Feature Visualization, we intend to provide an alternative view on a networks sensitivity to certain inputs that considers feature objectives rather than activations.

READ FULL TEXT

page 6

page 7

page 8

page 14

page 15

page 19

page 20

research
02/11/2016

Multifaceted Feature Visualization: Uncovering the Different Types of Features Learned By Each Neuron in Deep Neural Networks

We can better understand deep neural networks by identifying which featu...
research
06/12/2023

Adversarial Attacks on the Interpretation of Neuron Activation Maximization

The internal functional behavior of trained Deep Neural Networks is noto...
research
06/23/2021

How Well do Feature Visualizations Support Causal Understanding of CNN Activations?

One widely used approach towards understanding the inner workings of dee...
research
05/08/2019

Unsupervised Learning through Temporal Smoothing and Entropy Maximization

This paper proposes a method for machine learning from unlabeled data in...
research
07/06/2019

Towards Debugging Deep Neural Networks by Generating Speech Utterances

Deep neural networks (DNN) are able to successfully process and classify...
research
06/07/2023

Don't trust your eyes: on the (un)reliability of feature visualizations

How do neural networks extract patterns from pixels? Feature visualizati...
research
04/18/2019

Understanding Neural Networks via Feature Visualization: A survey

A neuroscience method to understanding the brain is to find and study th...

Please sign up or login with your details

Forgot password? Click here to reset