A Game Theoretic Approach to Class-wise Selective Rationalization

10/28/2019
by   Shiyu Chang, et al.
13

Selection of input features such as relevant pieces of text has become a common technique of highlighting how complex neural predictors operate. The selection can be optimized post-hoc for trained models or incorporated directly into the method itself (self-explaining). However, an overall selection does not properly capture the multi-faceted nature of useful rationales such as pros and cons for decisions. To this end, we propose a new game theoretic approach to class-dependent rationalization, where the method is specifically trained to highlight evidence supporting alternative conclusions. Each class involves three players set up competitively to find evidence for factual and counterfactual scenarios. We show theoretically in a simplified scenario how the game drives the solution towards meaningful class-dependent rationales. We evaluate the method in single- and multi-aspect sentiment classification tasks and demonstrate that the proposed method is able to identify both factual (justifying the ground truth label) and counterfactual (countering the ground truth label) rationales consistent with human rationalization. The code for our method is publicly available.

READ FULL TEXT
research
05/11/2021

Rationalization through Concepts

Automated predictions require explanations to be interpretable by humans...
research
05/26/2023

Counterfactuals of Counterfactuals: a back-translation-inspired approach to analyse counterfactual editors

In the wake of responsible AI, interpretability methods, which attempt t...
research
03/07/2023

GaussianMLR: Learning Implicit Class Significance via Calibrated Multi-Label Ranking

Existing multi-label frameworks only exploit the information deduced fro...
research
02/11/2023

A novel approach to generate datasets with XAI ground truth to evaluate image models

With the increased usage of artificial intelligence (AI), it is imperati...
research
07/10/2020

Robust Classification under Class-Dependent Domain Shift

Investigation of machine learning algorithms robust to changes between t...
research
06/24/2021

Meaningfully Explaining a Model's Mistakes

Understanding and explaining the mistakes made by trained models is crit...
research
06/06/2023

Designing Decision Support Systems Using Counterfactual Prediction Sets

Decision support systems for classification tasks are predominantly desi...

Please sign up or login with your details

Forgot password? Click here to reset