Towards explainable evaluation of language models on the semantic similarity of visual concepts

09/08/2022
by   Maria Lymperaiou, et al.
0

Recent breakthroughs in NLP research, such as the advent of Transformer models have indisputably contributed to major advancements in several tasks. However, few works research robustness and explainability issues of their evaluation strategies. In this work, we examine the behavior of high-performing pre-trained language models, focusing on the task of semantic similarity for visual vocabularies. First, we address the need for explainable evaluation metrics, necessary for understanding the conceptual quality of retrieved instances. Our proposed metrics provide valuable insights in local and global level, showcasing the inabilities of widely used approaches. Secondly, adversarial interventions on salient query semantics expose vulnerabilities of opaque metrics and highlight patterns in learned linguistic representations.

READ FULL TEXT

page 6

page 11

page 12

research
10/08/2021

Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors

Evaluation metrics are a key ingredient for progress of text generation ...
research
08/09/2022

Compositional Evaluation on Japanese Textual Entailment and Similarity

Natural Language Inference (NLI) and Semantic Textual Similarity (STS) a...
research
09/11/2022

Testing Pre-trained Language Models' Understanding of Distributivity via Causal Mediation Analysis

To what extent do pre-trained language models grasp semantic knowledge r...
research
08/15/2022

MENLI: Robust Evaluation Metrics from Natural Language Inference

Recently proposed BERT-based evaluation metrics perform well on standard...
research
05/08/2023

Knowledge Graph Guided Semantic Evaluation of Language Models For User Trust

A fundamental question in natural language processing is - what kind of ...
research
12/19/2022

(Psycho-)Linguistic Features Meet Transformer Models for Improved Explainable and Controllable Text Simplification

State-of-the-art text simplification (TS) systems adopt end-to-end neura...
research
09/10/2020

Patient Cohort Retrieval using Transformer Language Models

We apply deep learning-based language models to the task of patient coho...

Please sign up or login with your details

Forgot password? Click here to reset