Improving Human Text Comprehension through Semi-Markov CRF-based Neural Section Title Generation

04/15/2019
by   Sebastian Gehrmann, et al.
0

Titles of short sections within long documents support readers by guiding their focus towards relevant passages and by providing anchor-points that help to understand the progression of the document. The positive effects of section titles are even more pronounced when measured on readers with less developed reading abilities, for example in communities with limited labeled text resources. We, therefore, aim to develop techniques to generate section titles in low-resource environments. In particular, we present an extractive pipeline for section title generation by first selecting the most salient sentence and then applying deletion-based compression. Our compression approach is based on a Semi-Markov Conditional Random Field that leverages unsupervised word-representations such as ELMo or BERT, eliminating the need for a complex encoder-decoder architecture. The results show that this approach leads to competitive performance with sequence-to-sequence models with high resources, while strongly outperforming it with low resources. In a human-subject study across subjects with varying reading abilities, we find that our section titles improve the speed of completing comprehension tasks while retaining similar accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2019

Deleter: Leveraging BERT to Perform Unsupervised Successive Text Compression

Text compression has diverse applications such as Summarization, Reading...
research
09/08/2018

Generating Distractors for Reading Comprehension Questions from Real Examinations

We investigate the task of distractor generation for multiple choice rea...
research
10/26/2016

Broad Context Language Modeling as Reading Comprehension

Progress in text understanding has been driven by large datasets that te...
research
05/23/2022

BanglaNLG: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla

This work presents BanglaNLG, a comprehensive benchmark for evaluating n...
research
04/12/2022

Generating Full Length Wikipedia Biographies: The Impact of Gender Bias on the Retrieval-Based Generation of Women Biographies

Generating factual, long-form text such as Wikipedia articles raises thr...
research
05/24/2019

Outline Generation: Understanding the Inherent Content Structure of Documents

In this paper, we introduce and tackle the Outline Generation (OG) task,...

Please sign up or login with your details

Forgot password? Click here to reset