Context-Aware Document Simplification

05/10/2023
by   Liam Cripwell, et al.
0

To date, most work on text simplification has focused on sentence-level inputs. Early attempts at document simplification merely applied these approaches iteratively over the sentences of a document. However, this fails to coherently preserve the discourse structure, leading to suboptimal output quality. Recently, strategies from controllable simplification have been leveraged to achieve state-of-the-art results on document simplification by first generating a document-level plan (a sequence of sentence-level simplification operations) and using this plan to guide sentence-level simplification downstream. However, this is still limited in that the simplification model has no direct access to the local inter-sentence document context, likely having a negative impact on surface realisation. We explore various systems that use document context within the simplification process itself, either by iterating over larger text units or by extending the system architecture to attend over a high-level representation of document context. In doing so, we achieve state-of-the-art performance on the document simplification task, even when not relying on plan-guidance. Further, we investigate the performance and efficiency tradeoffs of system variants and make suggestions of when each should be preferred.

READ FULL TEXT
research
05/10/2021

DocOIE: A Document-level Context-Aware Dataset for OpenIE

Open Information Extraction (OpenIE) aims to extract structured relation...
research
10/08/2018

Improving the Transformer Translation Model with Document-Level Context

Although the Transformer translation model (Vaswani et al., 2017) has ac...
research
03/28/2020

HIN: Hierarchical Inference Network for Document-Level Relation Extraction

Document-level RE requires reading, inferring and aggregating over multi...
research
09/15/2021

Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering

Paraphrase generation is an important task in natural language processin...
research
02/19/2020

Toward Making the Most of Context in Neural Machine Translation

Document-level machine translation manages to outperform sentence level ...
research
06/07/2021

Diverse Pretrained Context Encodings Improve Document Translation

We propose a new architecture for adapting a sentence-level sequence-to-...
research
08/16/2019

Bidirectional Context-Aware Hierarchical Attention Network for Document Understanding

The Hierarchical Attention Network (HAN) has made great strides, but it ...

Please sign up or login with your details

Forgot password? Click here to reset