Improving Reading Comprehension Question Generation with Data Augmentation and Overgenerate-and-rank

06/15/2023
by   Nischal Ashok Kumar, et al.
0

Reading comprehension is a crucial skill in many aspects of education, including language learning, cognitive development, and fostering early literacy skills in children. Automated answer-aware reading comprehension question generation has significant potential to scale up learner support in educational activities. One key technical challenge in this setting is that there can be multiple questions, sometimes very different from each other, with the same answer; a trained question generation method may not necessarily know which question human educators would prefer. To address this challenge, we propose 1) a data augmentation method that enriches the training dataset with diverse questions given the same context and answer and 2) an overgenerate-and-rank method to select the best question from a pool of candidates. We evaluate our method on the FairytaleQA dataset, showing a 5 absolute improvement in ROUGE-L over the best existing method. We also demonstrate the effectiveness of our method in generating harder, "implicit" questions, where the answers are not contained in the context as text spans.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Learning to Ask Unanswerable Questions for Machine Reading Comprehension

Machine reading comprehension with unanswerable questions is a challengi...
research
10/04/2020

Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space

In this paper, we propose a novel data augmentation method, referred to ...
research
04/18/2021

Learning with Instance Bundles for Reading Comprehension

When training most modern reading comprehension models, all the question...
research
05/26/2023

GenQ: Automated Question Generation to Support Caregivers While Reading Stories with Children

When caregivers ask open–ended questions to motivate dialogue with child...
research
06/14/2017

Neural Models for Key Phrase Detection and Question Generation

We propose a two-stage neural model to tackle question generation from d...
research
10/20/2020

Bi-directional Cognitive Thinking Network for Machine Reading Comprehension

We propose a novel Bi-directional Cognitive Knowledge Framework (BCKF) f...
research
06/08/2021

Cheap and Good? Simple and Effective Data Augmentation for Low Resource Machine Reading

We propose a simple and effective strategy for data augmentation for low...

Please sign up or login with your details

Forgot password? Click here to reset