Content Selection in Deep Learning Models of Summarization

10/29/2018
by   Chris Kedzie, et al.
0

We carry out experiments with deep learning models of summarization across the domains of news, personal stories, meetings, and medical articles in order to understand how content selection is performed. We find that many sophisticated features of state of the art extractive summarizers do not improve performance over simpler models. These results suggest that it is easier to create a summarizer for a new domain than previous work suggests and bring into question the benefit of deep learning models for summarization for those domains that do have massive datasets (i.e., news). At the same time, they suggest important questions for new research in summarization; namely, new forms of sentence representations or external knowledge sources are needed that are better suited to the summarization task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2020

Examining the State-of-the-Art in News Timeline Summarization

Previous work on automatic news timeline summarization (TLS) leaves an u...
research
10/01/2019

BillSum: A Corpus for Automatic Summarization of US Legislation

Automatic summarization methods have been studied on a variety of domain...
research
04/21/2021

Text Summarization of Czech News Articles Using Named Entities

The foundation for the research of summarization in the Czech language w...
research
02/01/2023

HunSum-1: an Abstractive Summarization Dataset for Hungarian

We introduce HunSum-1: a dataset for Hungarian abstractive summarization...
research
05/25/2020

Deep Learning Models for Automatic Summarization

Text summarization is an NLP task which aims to convert a textual docume...
research
04/03/2017

Combining Lexical and Syntactic Features for Detecting Content-dense Texts in News

Content-dense news report important factual information about an event i...

Please sign up or login with your details

Forgot password? Click here to reset