EduQG: A Multi-format Multiple Choice Dataset for the Educational Domain

10/12/2022
by   Amir Hadifar, et al.
0

We introduce a high-quality dataset that contains 3,397 samples comprising (i) multiple choice questions, (ii) answers (including distractors), and (iii) their source documents, from the educational domain. Each question is phrased in two forms, normal and close. Correct answers are linked to source documents with sentence-level annotations. Thus, our versatile dataset can be used for both question and distractor generation, as well as to explore new challenges such as question format conversion. Furthermore, 903 questions are accompanied by their cognitive complexity level as per Bloom's taxonomy. All questions have been generated by educational experts rather than crowd workers to ensure they are maintaining educational and learning standards. Our analysis and experiments suggest distinguishable differences between our dataset and commonly used ones for question generation for educational purposes. We believe this new dataset can serve as a valuable resource for research and evaluation in the educational domain. The dataset and baselines will be released to support further research in question generation.

READ FULL TEXT

page 1

page 7

page 8

page 10

research
03/27/2022

Educational Question Generation of Children Storybooks via Question Type Distribution Learning and Event-Centric Summarization

Generating educational questions of fairytales or storybooks is vital fo...
research
07/19/2017

Crowdsourcing Multiple Choice Science Questions

We present a novel method for obtaining high-quality, domain-targeted mu...
research
04/30/2022

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs

NLP-powered automatic question generation (QG) techniques carry great pe...
research
01/01/2023

Chatbots as Problem Solvers: Playing Twenty Questions with Role Reversals

New chat AI applications like ChatGPT offer an advanced understanding of...
research
12/07/2022

Pre-Training With Scientific Text Improves Educational Question Generation

With the boom of digital educational materials and scalable e-learning s...
research
06/21/2023

Towards Enriched Controllability for Educational Question Generation

Question Generation (QG) is a task within Natural Language Processing (N...
research
10/25/2022

Learning to Reuse Distractors to support Multiple Choice Question Generation in Education

Multiple choice questions (MCQs) are widely used in digital learning sys...

Please sign up or login with your details

Forgot password? Click here to reset