Using Adversarial Attacks to Reveal the Statistical Bias in Machine Reading Comprehension Models

05/24/2021
by   Jieyu Lin, et al.
0

Pre-trained language models have achieved human-level performance on many Machine Reading Comprehension (MRC) tasks, but it remains unclear whether these models truly understand language or answer questions by exploiting statistical biases in datasets. Here, we demonstrate a simple yet effective method to attack MRC models and reveal the statistical biases in these models. We apply the method to the RACE dataset, for which the answer to each MRC question is selected from 4 options. It is found that several pre-trained language models, including BERT, ALBERT, and RoBERTa, show consistent preference to some options, even when these options are irrelevant to the question. When interfered by these irrelevant options, the performance of MRC models can be reduced from human-level performance to the chance-level performance. Human readers, however, are not clearly affected by these irrelevant options. Finally, we propose an augmented training method that can greatly reduce models' statistical biases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2020

Dual Multi-head Co-attention for Multi-choice Reading Comprehension

Multi-choice Machine Reading Comprehension (MRC) requires model to decid...
research
01/31/2023

The Impacts of Unanswerable Questions on the Robustness of Machine Reading Comprehension Models

Pretrained language models have achieved super-human performances on man...
research
06/23/2021

PALRACE: Reading Comprehension Dataset with Human Data and Labeled Rationales

Pre-trained language models achieves high performance on machine reading...
research
05/10/2021

ExpMRC: Explainability Evaluation for Machine Reading Comprehension

Achieving human-level performance on some of Machine Reading Comprehensi...
research
08/26/2021

Understanding Attention in Machine Reading Comprehension

Achieving human-level performance on some of Machine Reading Comprehensi...
research
10/22/2021

Challenges in Procedural Multimodal Machine Comprehension:A Novel Way To Benchmark

We focus on Multimodal Machine Reading Comprehension (M3C) where a model...
research
07/18/2022

MRCLens: an MRC Dataset Bias Detection Toolkit

Many recent neural models have shown remarkable empirical results in Mac...

Please sign up or login with your details

Forgot password? Click here to reset