Feeding What You Need by Understanding What You Learned

by   Xiaoqiang Wang, et al.

Machine Reading Comprehension (MRC) reveals the ability to understand a given text passage and answer questions based on it. Existing research works in MRC rely heavily on large-size models and corpus to improve the performance evaluated by metrics such as Exact Match (EM) and F_1. However, such a paradigm lacks sufficient interpretation to model capability and can not efficiently train a model with a large corpus. In this paper, we argue that a deep understanding of model capabilities and data properties can help us feed a model with appropriate training data based on its learning status. Specifically, we design an MRC capability assessment framework that assesses model capabilities in an explainable and multi-dimensional manner. Based on it, we further uncover and disentangle the connections between various data properties and model performance. Finally, to verify the effectiveness of the proposed MRC capability assessment framework, we incorporate it into a curriculum learning pipeline and devise a Capability Boundary Breakthrough Curriculum (CBBC) strategy, which performs a model capability-based training to maximize the data value and improve training efficiency. Extensive experiments demonstrate that our approach significantly improves performance, achieving up to an 11.22


page 2

page 5

page 15


LXPER Index 2.0: Improving Text Readability Assessment for L2 English Learners in South Korea

Most text readability assessment models are developed for the native rea...

LXPER Index: a curriculum-specific text readability assessment model for EFL students in Korea

Automatic readability assessment is one of the most important applicatio...

Simple and Effective Curriculum Pointer-Generator Networks for Reading Comprehension over Long Narratives

This paper tackles the problem of reading comprehension over long narrat...

Dialogue Response Selection with Hierarchical Curriculum Learning

We study the learning of a matching model for dialogue response selectio...

Prerequisites for Explainable Machine Reading Comprehension: A Position Paper

Machine reading comprehension (MRC) has received considerable attention ...

Cheap and Good? Simple and Effective Data Augmentation for Low Resource Machine Reading

We propose a simple and effective strategy for data augmentation for low...

Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

Augmenting large language models (LLMs) with external tools has emerged ...

Please sign up or login with your details

Forgot password? Click here to reset