NarraSum: A Large-Scale Dataset for Abstractive Narrative Summarization

12/02/2022
by   Chao Zhao, et al.
0

Narrative summarization aims to produce a distilled version of a narrative to describe its most salient events and characters. Summarizing a narrative is challenging as it requires an understanding of event causality and character behaviors. To encourage research in this direction, we propose NarraSum, a large-scale narrative summarization dataset. It contains 122K narrative documents, which are collected from plot descriptions of movies and TV episodes with diverse genres, and their corresponding abstractive summaries. Experiments show that there is a large performance gap between humans and the state-of-the-art summarization models on NarraSum. We hope that this dataset will promote future research in summarization, as well as broader studies of natural language understanding and generation. The dataset is available at https://github.com/zhaochaocs/narrasum.

READ FULL TEXT
research
10/21/2021

CNewSum: A Large-scale Chinese News Summarization Dataset with Human-annotated Adequacy and Deducibility Level

Automatic text summarization aims to produce a brief but crucial summary...
research
05/02/2023

The Role of Summarization in Generative Agents: A Preliminary Perspective

Generative agents that simulate human society show tremendous potential ...
research
10/04/2021

TLDR9+: A Large Scale Resource for Extreme Summarization of Social Media Posts

Recent models in developing summarization systems consist of millions of...
research
04/14/2021

SummScreen: A Dataset for Abstractive Screenplay Summarization

We introduce SummScreen, a summarization dataset comprised of pairs of T...
research
07/18/2022

GOAL: Towards Benchmarking Few-Shot Sports Game Summarization

Sports game summarization aims to generate sports news based on real-tim...
research
09/18/2021

TVRecap: A Dataset for Generating Stories with Character Descriptions

We introduce TVRecap, a story generation dataset that requires generatin...
research
09/24/2020

Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems

Automatic math word problem solving has attracted growing attention in r...

Please sign up or login with your details

Forgot password? Click here to reset