UMSE: Unified Multi-scenario Summarization Evaluation

05/26/2023
by   Shen Gao, et al.
0

Summarization quality evaluation is a non-trivial task in text summarization. Contemporary methods can be mainly categorized into two scenarios: (1) reference-based: evaluating with human-labeled reference summary; (2) reference-free: evaluating the summary consistency of the document. Recent studies mainly focus on one of these scenarios and explore training neural models built on PLMs to align with human criteria. However, the models from different scenarios are optimized individually, which may result in sub-optimal performance since they neglect the shared knowledge across different scenarios. Besides, designing individual models for each scenario caused inconvenience to the user. Inspired by this, we propose Unified Multi-scenario Summarization Evaluation Model (UMSE). More specifically, we propose a perturbed prefix tuning method to share cross-scenario knowledge between scenarios and use a self-supervised training paradigm to optimize the model without extra human labeling. Our UMSE is the first unified summarization evaluation framework engaged with the ability to be used in three evaluation scenarios. Experimental results across three typical scenarios on the benchmark dataset SummEval indicate that our UMSE can achieve comparable performance with several existing strong methods which are specifically designed for each scenario.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2023

On Learning to Summarize with Large Language Models as References

Recent studies have found that summaries generated by large language mod...
research
01/23/2022

WIDAR – Weighted Input Document Augmented ROUGE

The task of automatic text summarization has gained a lot of traction du...
research
06/26/2021

A Training-free and Reference-free Summarization Evaluation Metric via Centrality-weighted Relevance and Self-referenced Redundancy

In recent years, reference-based and supervised summarization evaluation...
research
03/27/2023

Large Language Models are Diverse Role-Players for Summarization Evaluation

Text summarization has a wide range of applications in many scenarios. T...
research
08/04/2023

Redundancy Aware Multi-Reference Based Gainwise Evaluation of Extractive Summarization

While very popular for evaluating extractive summarization task, the ROU...
research
05/23/2023

USB: A Unified Summarization Benchmark Across Tasks and Domains

An abundance of datasets exist for training and evaluating models on the...
research
04/04/2020

End-to-End Abstractive Summarization for Meetings

With the abundance of automatic meeting transcripts, meeting summarizati...

Please sign up or login with your details

Forgot password? Click here to reset