QRelScore: Better Evaluating Generated Questions with Deeper Understanding of Context-aware Relevance

04/29/2022
by   Xiaoqiang Wang, et al.
4

Existing metrics for assessing question generation not only require costly human reference but also fail to take into account the input context of generation, rendering the lack of deep understanding of the relevance between the generated questions and input contexts. As a result, they may wrongly penalize a legitimate and reasonable candidate question when it (i) involves complicated reasoning with the context or (ii) can be grounded by multiple evidences in the context. In this paper, we propose QRelScore, a context-aware Relevance evaluation metric for Question Generation. Based on off-the-shelf language models such as BERT and GPT2, QRelScore employs both word-level hierarchical matching and sentence-level prompt-based generation to cope with the complicated reasoning and diverse generation from multiple evidences, respectively. Compared with existing metrics, our experiments demonstrate that QRelScore is able to achieve a higher correlation with human judgments while being much more robust to adversarial samples.

READ FULL TEXT
research
08/19/2021

Language Model Augmented Relevance Score

Although automated metrics are commonly used to evaluate NLG systems, th...
research
11/02/2022

RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question

Existing metrics for evaluating the quality of automatically generated q...
research
05/26/2023

Evaluation of Question Generation Needs More References

Question generation (QG) is the task of generating a valid and fluent qu...
research
10/09/2022

QAScore – An Unsupervised Unreferenced Metric for the Question Generation Evaluation

Question Generation (QG) aims to automate the task of composing question...
research
10/25/2022

Learning to Reuse Distractors to support Multiple Choice Question Generation in Education

Multiple choice questions (MCQs) are widely used in digital learning sys...
research
10/12/2016

Question Generation from a Knowledge Base with Web Exploration

Question generation from a knowledge base (KB) is the task of generating...
research
05/20/2022

Low-cost Relevance Generation and Evaluation Metrics for Entity Resolution in AI

Entity Resolution (ER) in voice assistants is a prime component during r...

Please sign up or login with your details

Forgot password? Click here to reset