A Multilingual Perspective Towards the Evaluation of Attribution Methods in Natural Language Inference

04/11/2022
by   Kerem Zaman, et al.
0

Most evaluations of attribution methods focus on the English language. In this work, we present a multilingual approach for evaluating attribution methods for the Natural Language Inference (NLI) task in terms of plausibility and faithfulness properties. First, we introduce a novel cross-lingual strategy to measure faithfulness based on word alignments, which eliminates the potential downsides of erasure-based evaluations. We then perform a comprehensive evaluation of attribution methods, considering different output mechanisms and aggregation methods. Finally, we augment the XNLI dataset with highlight-based explanations, providing a multilingual NLI dataset with highlights, which may support future exNLP studies. Our results show that attribution methods performing best for plausibility and faithfulness are different.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2016

A Supervised Authorship Attribution Framework for Bengali Language

Authorship Attribution is a long-standing problem in Natural Language Pr...
research
05/10/2023

Automatic Evaluation of Attribution by Large Language Models

A recent focus of large language model (LLM) development, as exemplified...
research
06/10/2021

DT-grams: Structured Dependency Grammar Stylometry for Cross-Language Authorship Attribution

Cross-language authorship attribution problems rely on either translatio...
research
07/20/2020

Shopping in the Multiverse: A Counterfactual Approach to In-Session Attribution

We tackle the challenge of in-session attribution for on-site search eng...
research
12/23/2021

Measuring Attribution in Natural Language Generation Models

With recent improvements in natural language generation (NLG) models for...
research
05/18/2021

Darknet Data Mining – A Canadian Cyber-crime Perspective

Exploring the darknet can be a daunting task; this paper explores the ap...
research
08/27/2021

Translation Error Detection as Rationale Extraction

Recent Quality Estimation (QE) models based on multilingual pre-trained ...

Please sign up or login with your details

Forgot password? Click here to reset