ProtoVAE: A Trustworthy Self-Explainable Prototypical Variational Model

10/15/2022
by   Srishti Gautam, et al.
0

The need for interpretable models has fostered the development of self-explainable classifiers. Prior approaches are either based on multi-stage optimization schemes, impacting the predictive performance of the model, or produce explanations that are not transparent, trustworthy or do not capture the diversity of the data. To address these shortcomings, we propose ProtoVAE, a variational autoencoder-based framework that learns class-specific prototypes in an end-to-end manner and enforces trustworthiness and diversity by regularizing the representation space and introducing an orthonormality constraint. Finally, the model is designed to be transparent by directly incorporating the prototypes into the decision process. Extensive comparisons with previous self-explainable approaches demonstrate the superiority of ProtoVAE, highlighting its ability to generate trustworthy and diverse explanations, while not degrading predictive performance.

READ FULL TEXT

page 17

page 18

page 19

page 22

page 23

page 24

page 25

page 26

research
11/17/2020

On the Relationship Between KR Approaches for Explainable Planning

In this paper, we build upon notions from knowledge representation and r...
research
09/26/2022

Greybox XAI: a Neural-Symbolic learning framework to produce interpretable predictions for image classification

Although Deep Neural Networks (DNNs) have great generalization and predi...
research
04/29/2021

A First Look: Towards Explainable TextVQA Models via Visual and Textual Explanations

Explainable deep learning models are advantageous in many situations. Pr...
research
02/24/2021

Teach Me to Explain: A Review of Datasets for Explainable NLP

Explainable NLP (ExNLP) has increasingly focused on collecting human-ann...
research
10/18/2021

On Predictive Explanation of Data Anomalies

Numerous algorithms have been proposed for detecting anomalies (outliers...
research
07/28/2023

Toward Transparent Sequence Models with Model-Based Tree Markov Model

In this study, we address the interpretability issue in complex, black-b...
research
02/06/2023

Learning disentangled representations for explainable chest X-ray classification using Dirichlet VAEs

This study explores the use of the Dirichlet Variational Autoencoder (Di...

Please sign up or login with your details

Forgot password? Click here to reset