Syntactic Complexity Identification, Measurement, and Reduction Through Controlled Syntactic Simplification

04/16/2023
by   Muhammad Salman, et al.
0

Text simplification is one of the domains in Natural Language Processing (NLP) that offers an opportunity to understand the text in a simplified manner for exploration. However, it is always hard to understand and retrieve knowledge from unstructured text, which is usually in the form of compound and complex sentences. There are state-of-the-art neural network-based methods to simplify the sentences for improved readability while replacing words with plain English substitutes and summarising the sentences and paragraphs. In the Knowledge Graph (KG) creation process from unstructured text, summarising long sentences and substituting words is undesirable since this may lead to information loss. However, KG creation from text requires the extraction of all possible facts (triples) with the same mentions as in the text. In this work, we propose a controlled simplification based on the factual information in a sentence, i.e., triple. We present a classical syntactic dependency-based approach to split and rephrase a compound and complex sentence into a set of simplified sentences. This simplification process will retain the original wording with a simple structure of possible domain facts in each sentence, i.e., triples. The paper also introduces an algorithm to identify and measure a sentence's syntactic complexity (SC), followed by reduction through a controlled syntactic simplification process. Last, an experiment for a dataset re-annotation is also conducted through GPT3; we aim to publish this refined corpus as a resource. This work is accepted and presented in International workshop on Learning with Knowledge Graphs (IWLKG) at WSDM-2023 Conference. The code and data is available at www.github.com/sallmanm/SynSim.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/09/2013

Syntactic Analysis Based on Morphological Characteristic Features of the Romanian Language

This paper refers to the syntactic analysis of phrases in Romanian, as a...
research
08/14/2023

Can Knowledge Graphs Simplify Text?

Knowledge Graph (KG)-to-Text Generation has seen recent improvements in ...
research
09/29/2021

Multilingual Fact Linking

Knowledge-intensive NLP tasks can benefit from linking natural language ...
research
07/02/2022

Syntax Controlled Knowledge Graph-to-Text Generation with Order and Semantic Consistency

The knowledge graph (KG) stores a large amount of structural knowledge, ...
research
04/14/2023

SimpLex: a lexical text simplification architecture

Text simplification (TS) is the process of generating easy-to-understand...
research
03/23/2021

Annotation of Chinese Predicate Heads and Relevant Elements

A predicate head is a verbal expression that plays a role as the structu...
research
04/02/2022

Learning to Simplify with Data Hopelessly Out of Alignment

We consider whether it is possible to do text simplification without rel...

Please sign up or login with your details

Forgot password? Click here to reset