ABCD: A Graph Framework to Convert Complex Sentences to a Covering Set of Simple Sentences

06/22/2021
by   Yanjun Gao, et al.
0

Atomic clauses are fundamental text units for understanding complex sentences. Identifying the atomic sentences within complex sentences is important for applications such as summarization, argument mining, discourse analysis, discourse parsing, and question answering. Previous work mainly relies on rule-based methods dependent on parsing. We propose a new task to decompose each complex sentence into simple sentences derived from the tensed clauses in the source, and a novel problem formulation as a graph edit task. Our neural model learns to Accept, Break, Copy or Drop elements of a graph that combines word adjacency and grammatical dependencies. The full processing pipeline includes modules for graph construction, graph editing, and sentence generation from the output graph. We introduce DeSSE, a new dataset designed to train and evaluate complex sentence decomposition, and MinWiki, a subset of MinWikiSplit. ABCD achieves comparable performance as two parsing baselines on MinWiki. On DeSSE, which has a more even balance of complex sentence types, our model achieves higher accuracy on the number of atomic sentences than an encoder-decoder baseline. Results include a detailed error analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2019

EditNTS: An Neural Programmer-Interpreter Model for Sentence Simplification through Explicit Editing

We present the first sentence simplification model that learns explicit ...
research
07/06/2018

Sequential Copying Networks

Copying mechanism shows effectiveness in sequence-to-sequence based neur...
research
09/05/2019

TransSent: Towards Generation of Structured Sentences with Discourse Marker

This paper focuses on the task of generating long structured sentences w...
research
01/31/2023

Sentence Identification with BOS and EOS Label Combinations

The sentence is a fundamental unit in many NLP applications. Sentence se...
research
10/30/2019

Discourse-Aware Neural Extractive Model for Text Summarization

Recently BERT has been adopted in state-of-the-art text summarization mo...
research
10/05/2018

Scalable Micro-planned Generation of Discourse from Structured Data

We present a framework for generating natural language description from ...
research
09/07/2018

Textual Analogy Parsing: What's Shared and What's Compared among Analogous Facts

To understand a sentence like "whereas only 10 below the poverty line, 2...

Please sign up or login with your details

Forgot password? Click here to reset