AbLit: A Resource for Analyzing and Generating Abridged Versions of English Literature

02/13/2023
by   Melissa Roemmele, et al.
0

Creating an abridged version of a text involves shortening it while maintaining its linguistic qualities. In this paper, we examine this task from an NLP perspective for the first time. We present a new resource, AbLit, which is derived from abridged versions of English literature books. The dataset captures passage-level alignments between the original and abridged texts. We characterize the linguistic relations of these alignments, and create automated models to predict these relations as well as to generate abridgements for new texts. Our findings establish abridgement as a challenging task, motivating future resources and research. The dataset is available at github.com/roemmele/AbLit.

READ FULL TEXT

page 8

page 12

page 13

page 16

page 17

research
08/23/2023

Graecia capta ferum victorem cepit. Detecting Latin Allusions to Ancient Greek Literature

Intertextual allusions hold a pivotal role in Classical Philology, with ...
research
05/21/2023

Multilingual Simplification of Medical Texts

Automated text simplification aims to produce simple versions of complex...
research
06/01/2019

"President Vows to Cut <Taxes> Hair": Dataset and Analysis of Creative Text Editing for Humorous Headlines

We introduce, release, and analyze a new dataset, called Humicroedit, fo...
research
04/19/2022

I still have Time(s): Extending HeidelTime for German Texts

HeidelTime is one of the most widespread and successful tools for detect...
research
11/20/2019

Paraphrasing Verbs for Noun Compound Interpretation

An important challenge for the automatic analysis of English written tex...
research
04/07/2023

BenCoref: A Multi-Domain Dataset of Nominal Phrases and Pronominal Reference Annotations

Coreference Resolution is a well studied problem in NLP. While widely st...
research
01/08/2023

Traditional Readability Formulas Compared for English

Traditional English readability formulas, or equations, were largely dev...

Please sign up or login with your details

Forgot password? Click here to reset