Don't Say What You Don't Know: Improving the Consistency of Abstractive Summarization by Constraining Beam Search

03/16/2022
by   Daniel King, et al.
5

Abstractive summarization systems today produce fluent and relevant output, but often "hallucinate" statements not supported by the source text. We analyze the connection between hallucinations and training data, and find evidence that models hallucinate because they train on target summaries that are unsupported by the source. Based on our findings, we present PINOCCHIO, a new decoding method that improves the consistency of a transformer-based abstractive summarizer by constraining beam search to avoid hallucinations. Given the model states and outputs at a given step, PINOCCHIO detects likely model hallucinations based on various measures of attribution to the source text. PINOCCHIO backtracks to find more consistent output, and can opt to produce no summary at all when no consistent generation can be found. In experiments, we find that PINOCCHIO improves the consistency of generation (in terms of F1) by an average of 67

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2020

Constrained Abstractive Summarization: Preserving Factual Consistency with Constrained Generation

Summaries generated by abstractive summarization are supposed to only co...
research
03/06/2023

Faithfulness-Aware Decoding Strategies for Abstractive Summarization

Despite significant progress in understanding and improving faithfulness...
research
02/05/2018

Diverse Beam Search for Increased Novelty in Abstractive Summarization

Text summarization condenses a text to a shorter version while retaining...
research
05/28/2022

Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization

Text summarization aims to generate a short summary for an input text. I...
research
06/21/2019

Be Consistent! Improving Procedural Text Comprehension using Label Consistency

Our goal is procedural text comprehension, namely tracking how the prope...
research
09/08/2022

Applying Transformer-based Text Summarization for Keyphrase Generation

Keyphrases are crucial for searching and systematizing scholarly documen...
research
06/03/2021

Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution

Despite the prominence of neural abstractive summarization models, we kn...

Please sign up or login with your details

Forgot password? Click here to reset