Consistency and Coherence from Points of Contextual Similarity

12/22/2021
by   Oleg Vasilyev, et al.
0

Factual consistency is one of important summary evaluation dimensions, especially as summary generation becomes more fluent and coherent. The ESTIME measure, recently proposed specifically for factual consistency, achieves high correlations with human expert scores both for consistency and fluency, while in principle being restricted to evaluating such text-summary pairs that have high dictionary overlap. This is not a problem for current styles of summarization, but it may become an obstacle for future summarization systems, or for evaluating arbitrary claims against the text. In this work we generalize the method, and make a variant of the measure applicable to any text-summary pairs. As ESTIME uses points of contextual similarity, it provides insights into usefulness of information taken from different BERT layers. We observe that useful information exists in almost all of the layers except the several lowest ones. For consistency and fluency - qualities focused on local text details - the most useful layers are close to the top (but not at the top); for coherence and relevance we found a more complicated and interesting picture.

READ FULL TEXT

page 4

page 6

page 9

research
01/23/2022

WIDAR – Weighted Input Document Augmented ROUGE

The task of automatic text summarization has gained a lot of traction du...
research
06/06/2017

Text Summarization using Abstract Meaning Representation

With an ever increasing size of text present on the Internet, automatic ...
research
04/12/2021

Estimation of Summary-to-Text Inconsistency by Mismatched Embeddings

We propose a new reference-free summary quality evaluation measure, with...
research
07/11/2022

SummScore: A Comprehensive Evaluation Metric for Summary Quality Based on Cross-Encoder

Text summarization models are often trained to produce summaries that me...
research
09/19/2021

Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries

Current pre-trained models applied to summarization are prone to factual...
research
10/18/2022

Summary Workbench: Unifying Application and Evaluation of Text Summarization Models

This paper presents Summary Workbench, a new tool for developing and eva...

Please sign up or login with your details

Forgot password? Click here to reset