Split and Rephrase

07/21/2017
by   Shashi Narayan, et al.
0

We propose a new sentence simplification task (Split-and-Rephrase) where the aim is to split a complex sentence into a meaning preserving sequence of shorter sentences. Like sentence simplification, splitting-and-rephrasing has the potential of benefiting both natural language processing and societal applications. Because shorter sentences are generally better processed by NLP systems, it could be used as a preprocessing step which facilitates and improves the performance of parsers, semantic role labellers and machine translation systems. It should also be of use for people with reading disabilities because it allows the conversion of longer sentences into shorter ones. This paper makes two contributions towards this new task. First, we create and make available a benchmark consisting of 1,066,115 tuples mapping a single complex sentence to a sequence of sentences expressing the same meaning. Second, we propose five models (vanilla sequence-to-sequence to semantically-motivated models) to understand the difficulty of the proposed task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2021

BiSECT: Learning to Split and Rephrase Sentences with Bitexts

An important task in NLP applications such as sentence simplification is...
research
10/25/2022

Revision for Concision: A Constrained Paraphrase Generation Task

Academic writing should be concise as concise sentences better keep the ...
research
05/02/2018

Split and Rephrase: Better Evaluation and a Stronger Baseline

Splitting and rephrasing a complex sentence into several shorter sentenc...
research
01/31/2023

Sentence Identification with BOS and EOS Label Combinations

The sentence is a fundamental unit in many NLP applications. Sentence se...
research
10/06/2020

COD3S: Diverse Generation with Discrete Semantic Signatures

We present COD3S, a novel method for generating semantically diverse sen...
research
09/26/2019

MinWikiSplit: A Sentence Splitting Corpus with Minimal Propositions

We compiled a new sentence splitting corpus that is composed of 203K pai...
research
01/16/2020

Fact-aware Sentence Split and Rephrase with Permutation Invariant Training

Sentence Split and Rephrase aims to break down a complex sentence into s...

Please sign up or login with your details

Forgot password? Click here to reset