Interpreting Deep Classifier by Visual Distillation of Dark Knowledge

03/11/2018
by   Kai Xu, et al.
0

Interpreting black box classifiers, such as deep networks, allows an analyst to validate a classifier before it is deployed in a high-stakes setting. A natural idea is to visualize the deep network's representations, so as to "see what the network sees". In this paper, we demonstrate that standard dimension reduction methods in this setting can yield uninformative or even misleading visualizations. Instead, we present DarkSight, which visually summarizes the predictions of a classifier in a way inspired by notion of dark knowledge. DarkSight embeds the data points into a low-dimensional space such that it is easy to compress the deep classifier into a simpler one, essentially combining model compression and dimension reduction. We compare DarkSight against t-SNE both qualitatively and quantitatively, demonstrating that DarkSight visualizations are more informative. Our method additionally yields a new confidence measure based on dark knowledge by quantifying how unusual a given vector of predictions is.

READ FULL TEXT

page 8

page 14

research
10/25/2022

A Spectral Method for Assessing and Combining Multiple Data Visualizations

Dimension reduction and data visualization aim to project a high-dimensi...
research
09/14/2021

ε-isometric dimension reduction for incompressible subsets of ℓ_p

Fix p∈[1,∞), K∈(0,∞) and a probability measure μ. We prove that for ever...
research
11/30/2015

Universality laws for randomized dimension reduction, with applications

Dimension reduction is the process of embedding high-dimensional data in...
research
03/19/2015

Reduced Basis Decomposition: a Certified and Fast Lossy Data Compression Algorithm

Dimension reduction is often needed in the area of data mining. The goal...
research
04/22/2019

Local Deep-Feature Alignment for Unsupervised Dimension Reduction

This paper presents an unsupervised deep-learning framework named Local ...
research
05/05/2020

Interpreting Deep Models through the Lens of Data

Identification of input data points relevant for the classifier (i.e. se...

Please sign up or login with your details

Forgot password? Click here to reset