Understanding Text Classification Data and Models Using Aggregated Input Salience

11/10/2022
by   Sebastian Ebert, et al.
0

Realizing when a model is right for a wrong reason is not trivial and requires a significant effort by model developers. In some cases, an input salience method, which highlights the most important parts of the input, may reveal problematic reasoning. But scrutinizing highlights over many data instances is tedious and often infeasible. Furthermore, analyzing examples in isolation does not reveal general patterns in the data or in the model's behavior. In this paper we aim to address these issues and go from understanding single examples to understanding entire datasets and models. The methodology we propose is based on aggregated salience maps. Using this methodology we address multiple distinct but common model developer needs by showing how problematic data and model behavior can be identified – a necessary first step for improving the model.

READ FULL TEXT
research
08/28/2023

Machine Unlearning Methodology base on Stochastic Teacher Network

The rise of the phenomenon of the "right to be forgotten" has prompted r...
research
06/21/2021

Aggregated functional data model applied on clustering and disaggregation of UK electrical load profiles

Understanding electrical energy demand at the consumer level plays an im...
research
04/20/2018

Right Answer for the Wrong Reason: Discovery and Mitigation

Exposing the weaknesses of neural models is crucial for improving their ...
research
05/13/2022

Interlock-Free Multi-Aspect Rationalization for Text Classification

Explanation is important for text classification tasks. One prevalent ty...
research
07/13/2020

Data from Model: Extracting Data from Non-robust and Robust Models

The essence of deep learning is to exploit data to train a deep neural n...
research
02/22/2021

Improving Concept Learning Through Specialized Digital Fanzines

Specialized digital fanzines were successfully used to facilitate learni...
research
01/13/2023

Identification in a Binary Choice Panel Data Model with a Predetermined Covariate

We study identification in a binary choice panel data model with a singl...

Please sign up or login with your details

Forgot password? Click here to reset