Towards Understanding Omission in Dialogue Summarization

11/14/2022
by   Yicheng Zou, et al.
0

Dialogue summarization aims to condense the lengthy dialogue into a concise summary, and has recently achieved significant progress. However, the result of existing methods is still far from satisfactory. Previous works indicated that omission is a major factor in affecting the quality of summarization, but few of them have further explored the omission problem, such as how omission affects summarization results and how to detect omission, which is critical for reducing omission and improving summarization quality. Moreover, analyzing and detecting omission relies on summarization datasets with omission labels (i.e., which dialogue utterances are omitted in the summarization), which are not available in the current literature. In this paper, we propose the OLDS dataset, which provides high-quality Omission Labels for Dialogue Summarization. By analyzing this dataset, we find that a large improvement in summarization quality can be achieved by providing ground-truth omission labels for the summarization model to recover omission information, which demonstrates the importance of omission detection for omission mitigation in dialogue summarization. Therefore, we formulate an omission detection task and demonstrate our proposed dataset can support the training and evaluation of this task well. We also call for research action on omission detection based on our proposed datasets. Our dataset and codes are publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2021

A Survey on Dialogue Summarization: Recent Advances and New Frontiers

With the development of dialogue systems and natural language generation...
research
08/03/2021

Dialogue Summarization with Supporting Utterance Flow Modeling and Fact Regularization

Dialogue summarization aims to generate a summary that indicates the key...
research
10/17/2022

Leveraging Non-dialogue Summaries for Dialogue Summarization

To mitigate the lack of diverse dialogue summarization datasets in acade...
research
04/27/2022

An End-to-End Dialogue Summarization System for Sales Calls

Summarizing sales calls is a routine task performed manually by salespeo...
research
09/30/2019

A Closer Look at Data Bias in Neural Extractive Summarization Models

In this paper, we take stock of the current state of summarization datas...
research
07/28/2023

Summaries, Highlights, and Action items: Design, implementation and evaluation of an LLM-powered meeting recap system

Meetings play a critical infrastructural role in the coordination of wor...
research
04/27/2020

The Gutenberg Dialogue Dataset

Large datasets are essential for many NLP tasks. Current publicly availa...

Please sign up or login with your details

Forgot password? Click here to reset