Label-Free Explainability for Unsupervised Models

03/03/2022
by   Jonathan Crabbé, et al.
12

Unsupervised black-box models are challenging to interpret. Indeed, most existing explainability methods require labels to select which component(s) of the black-box's output to interpret. In the absence of labels, black-box outputs often are representation vectors whose components do not correspond to any meaningful quantity. Hence, choosing which component(s) to interpret in a label-free unsupervised/self-supervised setting is an important, yet unsolved problem. To bridge this gap in the literature, we introduce two crucial extensions of post-hoc explanation techniques: (1) label-free feature importance and (2) label-free example importance that respectively highlight influential features and training examples for a black-box to construct representations at inference time. We demonstrate that our extensions can be successfully implemented as simple wrappers around many existing feature and example importance methods. We illustrate the utility of our label-free explainability paradigm through a qualitative and quantitative comparison of representation spaces learned by various autoencoders trained on distinct unsupervised tasks.

READ FULL TEXT

page 7

page 9

page 24

page 25

page 26

page 27

research
04/16/2023

Explanations of Black-Box Models based on Directional Feature Interactions

As machine learning algorithms are deployed ubiquitously to a variety of...
research
07/27/2023

Verifiable Feature Attributions: A Bridge between Post Hoc Explainability and Inherent Interpretability

With the increased deployment of machine learning models in various real...
research
09/29/2021

Critical Empirical Study on Black-box Explanations in AI

This paper provides empirical concerns about post-hoc explanations of bl...
research
03/21/2023

Do intermediate feature coalitions aid explainability of black-box models?

This work introduces the notion of intermediate concepts based on levels...
research
10/06/2021

Unifying Likelihood-free Inference with Black-box Sequence Design and Beyond

Black-box optimization formulations for biological sequence design have ...
research
02/23/2021

Feature Importance Explanations for Temporal Black-Box Models

Models in the supervised learning framework may capture rich and complex...
research
03/13/2019

Improving Transparency of Deep Neural Inference Process

Deep learning techniques are rapidly advanced recently, and becoming a n...

Please sign up or login with your details

Forgot password? Click here to reset