A Self-Training Method for Machine Reading Comprehension with Soft Evidence Extraction

05/11/2020
by   Yilin Niu, et al.
0

Neural models have achieved great success on machine reading comprehension (MRC), many of which typically consist of two components: an evidence extractor and an answer predictor. The former seeks the most relevant information from a reference text, while the latter is to locate or generate answers from the extracted evidence. Despite the importance of evidence labels for training the evidence extractor, they are not cheaply accessible, particularly in many non-extractive MRC tasks such as YES/NO question answering and multi-choice MRC. To address this problem, we present a Self-Training method (STM), which supervises the evidence extractor with auto-generated evidence labels in an iterative process. At each iteration, a base MRC model is trained with golden answers and noisy evidence labels. The trained model will predict pseudo evidence labels as extra supervision in the next iteration. We evaluate STM on seven datasets over three MRC tasks. Experimental results demonstrate the improvement on existing MRC models, and we also analyze how and why such a self-training method works in MRC.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/23/2019

Evidence Sentence Extraction for Machine Reading Comprehension

Recently remarkable success has been achieved in machine reading compreh...
research
05/10/2021

REPT: Bridging Language Models and Machine Reading Comprehension via Retrieval-Based Pre-training

Pre-trained Language Models (PLMs) have achieved great success on Machin...
research
08/21/2018

CoQA: A Conversational Question Answering Challenge

Humans gather information by engaging in conversations involving a serie...
research
10/06/2022

U3E: Unsupervised and Erasure-based Evidence Extraction for Machine Reading Comprehension

More tasks in Machine Reading Comprehension(MRC) require, in addition to...
research
09/19/2023

Benchmarks for Pirá 2.0, a Reading Comprehension Dataset about the Ocean, the Brazilian Coast, and Climate Change

Pirá is a reading comprehension dataset focused on the ocean, the Brazil...
research
08/18/2021

EviDR: Evidence-Emphasized Discrete Reasoning for Reasoning Machine Reading Comprehension

Reasoning machine reading comprehension (R-MRC) aims to answer complex q...
research
07/12/2016

Separating Answers from Queries for Neural Reading Comprehension

We present a novel neural architecture for answering queries, designed t...

Please sign up or login with your details

Forgot password? Click here to reset