EntSUM: A Data Set for Entity-Centric Summarization

by   Mounica Maddela, et al.

Controllable summarization aims to provide summaries that take into account user-specified aspects and preferences to better assist them with their information need, as opposed to the standard summarization setup which build a single generic summary of a document. We introduce a human-annotated data set EntSUM for controllable summarization with a focus on named entities as the aspects to control. We conduct an extensive quantitative analysis to motivate the task of entity-centric summarization and show that existing methods for controllable summarization fail to generate entity-centric summaries. We propose extensions to state-of-the-art summarization approaches that achieve substantially better results on our data set. Our analysis and results show the challenging nature of this task and of the proposed data set.


page 1

page 4


CTRLsum: Towards Generic Controllable Text Summarization

Current summarization systems yield generic summaries that are disconnec...

Controllable Abstractive Summarization

Current models for document summarization ignore user preferences such a...

Controllable Neural Dialogue Summarization with Personal Named Entity Planning

In this paper, we propose a controllable neural generation framework tha...

Topic-Aware Evaluation and Transformer Methods for Topic-Controllable Summarization

Topic-controllable summarization is an emerging research area with a wid...

ComSum: Commit Messages Summarization and Meaning Preservation

We present ComSum, a data set of 7 million commit messages for text summ...

Controllable Summarization with Constrained Markov Decision Process

We study controllable text summarization which allows users to gain cont...

Specificity-Based Sentence Ordering for Multi-Document Extractive Risk Summarization

Risk mining technologies seek to find relevant textual extractions that ...