A Study of Automatic Metrics for the Evaluation of Natural Language Explanations

03/15/2021
by   Miruna Clinciu, et al.
0

As transparency becomes key for robotics and AI, it will be necessary to evaluate the methods through which transparency is provided, including automatically generated natural language (NL) explanations. Here, we explore parallels between the generation of such explanations and the much-studied field of evaluation of Natural Language Generation (NLG). Specifically, we investigate which of the NLG evaluation measures map well to explanations. We present the ExBAN corpus: a crowd-sourced corpus of NL explanations for Bayesian Networks. We run correlations comparing human subjective ratings with NLG automatic measures. We find that embedding-based automatic NLG evaluation methods, such as BERTScore and BLEURT, have a higher correlation with human ratings, compared to word-overlap metrics, such as BLEU and ROUGE. This work has implications for Explainable AI and transparent robotic and autonomous systems.

READ FULL TEXT
research
08/18/2021

I don't understand! Evaluation Methods for Natural Language Explanations

Explainability of intelligent systems is key for future adoption. While ...
research
05/24/2023

Using Natural Language Explanations to Rescale Human Judgments

The rise of large language models (LLMs) has brought a critical need for...
research
09/16/2019

Communication-based Evaluation for Natural Language Generation

Natural language generation (NLG) systems are commonly evaluated using n...
research
04/12/2021

Estimating Subjective Crowd-Evaluations as an Additional Objective to Improve Natural Language Generation

Human ratings are one of the most prevalent methods to evaluate the perf...
research
05/31/2023

A Surrogate Model Framework for Explainable Autonomous Behaviour

Adoption and deployment of robotic and autonomous systems in industry ar...
research
02/27/2023

Evaluation of Automatically Constructed Word Meaning Explanations

Preparing exact and comprehensive word meaning explanations is one of th...
research
08/27/2023

Situated Natural Language Explanations

Natural language is among the most accessible tools for explaining decis...

Please sign up or login with your details

Forgot password? Click here to reset