Let the CAT out of the bag: Contrastive Attributed explanations for Text

09/16/2021
by   Saneem Chemmengath, et al.
5

Contrastive explanations for understanding the behavior of black box models has gained a lot of attention recently as they provide potential for recourse. In this paper, we propose a method Contrastive Attributed explanations for Text (CAT) which provides contrastive explanations for natural language text data with a novel twist as we build and exploit attribute classifiers leading to more semantically meaningful explanations. To ensure that our contrastive generated text has the fewest possible edits with respect to the original text, while also being fluent and close to a human generated contrastive, we resort to a minimal perturbation approach regularized using a BERT language model and attribute classifiers trained on available attributes. We show through qualitative examples and a user study that our method not only conveys more insight because of these attributes, but also leads to better quality (contrastive) text. Moreover, quantitatively we show that our method is more efficient than other state-of-the-art methods with it also scoring higher on benchmark metrics such as flip rate, (normalized) Levenstein distance, fluency and content preservation.

READ FULL TEXT
research
05/31/2019

Model Agnostic Contrastive Explanations for Structured Data

Recently, a method [7] was proposed to generate contrastive explanations...
research
05/29/2019

Generating Contrastive Explanations with Monotonic Attribute Functions

Explaining decisions of deep neural networks is a hot research topic wit...
research
11/19/2018

Towards Global Explanations for Credit Risk Scoring

In this paper we propose a method to obtain global explanations for trai...
research
06/21/2019

Generating Counterfactual and Contrastive Explanations using SHAP

With the advent of GDPR, the domain of explainable AI and model interpre...
research
08/11/2023

Contrastive Explanations of Multi-agent Optimization Solutions

In many real-world scenarios, agents are involved in optimization proble...
research
08/21/2023

SupEuclid: Extremely Simple, High Quality OoD Detection with Supervised Contrastive Learning and Euclidean Distance

Out-of-Distribution (OoD) detection has developed substantially in the p...
research
09/22/2020

ALICE: Active Learning with Contrastive Natural Language Explanations

Training a supervised neural network classifier typically requires many ...

Please sign up or login with your details

Forgot password? Click here to reset