HINT: Hierarchical Neuron Concept Explainer

03/27/2022
by   Andong Wang, et al.
0

To interpret deep networks, one main approach is to associate neurons with human-understandable concepts. However, existing methods often ignore the inherent relationships of different concepts (e.g., dog and cat both belong to animals), and thus lose the chance to explain neurons responsible for higher-level concepts (e.g., animal). In this paper, we study hierarchical concepts inspired by the hierarchical cognition process of human beings. To this end, we propose HIerarchical Neuron concepT explainer (HINT) to effectively build bidirectional associations between neurons and hierarchical concepts in a low-cost and scalable manner. HINT enables us to systematically and quantitatively study whether and how the implicit hierarchical relationships of concepts are embedded into neurons, such as identifying collaborative neurons responsible to one concept and multimodal neurons for different concepts, at different semantic levels from concrete concepts (e.g., dog) to more abstract ones (e.g., animal). Finally, we verify the faithfulness of the associations using Weakly Supervised Object Localization, and demonstrate its applicability in various tasks such as discovering saliency regions and explaining adversarial attacks. Code is available on https://github.com/AntonotnaWang/HINT.

READ FULL TEXT

page 33

page 34

page 35

page 36

page 37

page 38

page 39

page 40

research
08/29/2021

NeuroCartography: Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks

Existing research on making sense of deep neural networks often focuses ...
research
10/06/2020

Unsupervised Hierarchical Concept Learning

Discovering concepts (or temporal abstractions) in an unsupervised manne...
research
02/23/2020

Neuron Shapley: Discovering the Responsible Neurons

We develop Neuron Shapley as a new framework to quantify the contributio...
research
08/08/2018

Choose Your Neuron: Incorporating Domain Knowledge through Neuron-Importance

Individual neurons in convolutional neural networks supervised for image...
research
04/09/2019

A Feature-Value Network as a Brain Model

This paper suggests a statistical framework for describing the relations...
research
03/07/2022

Explaining Classifiers by Constructing Familiar Concepts

Interpreting a large number of neurons in deep learning is difficult. Ou...
research
04/23/2022

CLIP-Dissect: Automatic Description of Neuron Representations in Deep Vision Networks

In this paper, we propose CLIP-Dissect, a new technique to automatically...

Please sign up or login with your details

Forgot password? Click here to reset