Balancing Lexical and Semantic Quality in Abstractive Summarization

05/17/2023
by   Jeewoo Sul, et al.
0

An important problem of the sequence-to-sequence neural models widely used in abstractive summarization is exposure bias. To alleviate this problem, re-ranking systems have been applied in recent years. Despite some performance improvements, this approach remains underexplored. Previous works have mostly specified the rank through the ROUGE score and aligned candidate summaries, but there can be quite a large gap between the lexical overlap metric and semantic similarity. In this paper, we propose a novel training method in which a re-ranker balances the lexical and semantic quality. We further newly define false positives in ranking and present a strategy to reduce their influence. Experiments on the CNN/DailyMail and XSum datasets show that our method can estimate the meaning of summaries without seriously degrading the lexical aspect. More specifically, it achieves an 89.67 BERTScore on the CNN/DailyMail dataset, reaching new state-of-the-art performance. Our code is publicly available at https://github.com/jeewoo1025/BalSum.

READ FULL TEXT
research
10/17/2022

Towards Summary Candidates Fusion

Sequence-to-sequence deep neural models fine-tuned for abstractive summa...
research
03/13/2022

SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization

Sequence-to-sequence neural networks have recently achieved great succes...
research
02/18/2020

Learning by Semantic Similarity Makes Abstractive Summarization Better

One of the obstacles of abstractive summarization is the presence of var...
research
10/14/2018

Robust Neural Abstractive Summarization Systems and Evaluation against Adversarial Information

Sequence-to-sequence (seq2seq) neural models have been actively investig...
research
04/15/2021

RefSum: Refactoring Neural Summarization

Although some recent works show potential complementarity among differen...
research
10/20/2017

A Semantically Motivated Approach to Compute ROUGE Scores

ROUGE is one of the first and most widely used evaluation metrics for te...
research
05/12/2023

What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization

Summarization models often generate text that is poorly calibrated to qu...

Please sign up or login with your details

Forgot password? Click here to reset