Text Simplification for Comprehension-based Question-Answering

09/28/2021
by   Tanvi Dadu, et al.
12

Text simplification is the process of splitting and rephrasing a sentence to a sequence of sentences making it easier to read and understand while preserving the content and approximating the original meaning. Text simplification has been exploited in NLP applications like machine translation, summarization, semantic role labeling, and information extraction, opening a broad avenue for its exploitation in comprehension-based question-answering downstream tasks. In this work, we investigate the effect of text simplification in the task of question-answering using a comprehension context. We release Simple-SQuAD, a simplified version of the widely-used SQuAD dataset. Firstly, we outline each step in the dataset creation pipeline, including style transfer, thresholding of sentences showing correct transfer, and offset finding for each answer. Secondly, we verify the quality of the transferred sentences through various methodologies involving both automated and human evaluation. Thirdly, we benchmark the newly created corpus and perform an ablation study for examining the effect of the simplification process in the SQuAD-based question answering task. Our experiments show that simplification leads to up to 2.04 Finally, we conclude with an analysis of the transfer process, investigating the types of edits made by the model, and the effect of sentence length on the transfer model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2023

HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language

This paper presents HaVQA, the first multimodal dataset for visual quest...
research
02/09/2021

Decontextualization: Making Sentences Stand-Alone

Models for question answering, dialogue agents, and summarization often ...
research
11/01/2021

Discourse Comprehension: A Question Answering Framework to Represent Sentence Connections

While there has been substantial progress in text comprehension through ...
research
01/14/2021

TSQA: Tabular Scenario Based Question Answering

Scenario-based question answering (SQA) has attracted an increasing rese...
research
06/17/2023

Persian Semantic Role Labeling Using Transfer Learning and BERT-Based Models

Semantic role labeling (SRL) is the process of detecting the predicate-a...
research
07/16/2023

A Neural-Symbolic Approach Towards Identifying Grammatically Correct Sentences

Textual content around us is growing on a daily basis. Numerous articles...
research
03/05/2020

Talking-Heads Attention

We introduce "talking-heads attention" - a variation on multi-head atten...

Please sign up or login with your details

Forgot password? Click here to reset