DisSim: A Discourse-Aware Syntactic Text Simplification Frameworkfor English and German

09/26/2019
by   Christina Niklaus, et al.
0

We introduce DisSim, a discourse-aware sentence splitting framework for English and German whose goal is to transform syntactically complex sentences into an intermediate representation that presents a simple and more regular structure which is easier to process for downstream semantic applications. For this purpose, we turn input sentences into a two-layered semantic hierarchy in the form of core facts and accompanying contexts, while identifying the rhetorical relations that hold between them. In that way, we preserve the coherence structure of the input and, hence, its interpretability for downstream tasks.

READ FULL TEXT
research
08/28/2018

Graphene: A Context-Preserving Open Information Extraction System

We introduce Graphene, an Open IE system whose goal is to generate accur...
research
08/01/2023

Discourse-Aware Text Simplification: From Complex Sentences to Linked Propositions

Sentences that present a complex syntax act as a major stumbling block f...
research
05/24/2021

Context-Preserving Text Simplification

We present a context-preserving text simplification (TS) approach that r...
research
06/03/2019

Transforming Complex Sentences into a Semantic Hierarchy

We present an approach for recursively splitting and rephrasing complex ...
research
09/26/2019

MinWikiSplit: A Sentence Splitting Corpus with Minimal Propositions

We compiled a new sentence splitting corpus that is composed of 203K pai...
research
01/17/2023

Learning a Formality-Aware Japanese Sentence Representation

While the way intermediate representations are generated in encoder-deco...
research
09/06/2019

Efficient Sentence Embedding using Discrete Cosine Transform

Vector averaging remains one of the most popular sentence embedding meth...

Please sign up or login with your details

Forgot password? Click here to reset