BIOMRC: A Dataset for Biomedical Machine Reading Comprehension

05/13/2020
by   Petros Stavropoulos, et al.
0

We introduce BIOMRC, a large-scale cloze-style biomedical MRC dataset. Care was taken to reduce noise, compared to the previous BIOREAD dataset of Pappas et al. (2018). Experiments show that simple heuristics do not perform well on the new dataset, and that two neural MRC models that had been tested on BIOREAD perform much better on BIOMRC, indicating that the new dataset is indeed less noisy or at least that its task is more feasible. Non-expert human performance is also higher on the new dataset compared to BIOREAD, and biomedical experts perform even better. We also introduce a new BERT-based MRC model, the best version of which substantially outperforms all other methods tested, reaching or surpassing the accuracy of biomedical experts in some experiments. We make the new dataset available in three different sizes, also releasing our code, and providing a leaderboard.

READ FULL TEXT
research
04/13/2022

A Distant Supervision Corpus for Extracting Biomedical Relationships Between Chemicals, Diseases and Genes

We introduce ChemDisGene, a new dataset for training and evaluating mult...
research
09/28/2021

Single-dataset Experts for Multi-dataset Question Answering

Many datasets have been created for training reading comprehension model...
research
07/01/2020

DocVQA: A Dataset for VQA on Document Images

We present a new dataset for Visual Question Answering on document image...
research
07/26/2021

Image-Based Parking Space Occupancy Classification: Dataset and Baseline

We introduce a new dataset for image-based parking space occupancy class...
research
01/08/2022

Image-based Automatic Dial Meter Reading in Unconstrained Scenarios

The replacement of analog meters with smart meters is costly, laborious,...
research
02/26/2022

BioADAPT-MRC: Adversarial Learning-based Domain Adaptation Improves Biomedical Machine Reading Comprehension Task

Motivation: Biomedical machine reading comprehension (biomedical-MRC) ai...
research
10/04/2022

Modular Approach to Machine Reading Comprehension: Mixture of Task-Aware Experts

In this work we present a Mixture of Task-Aware Experts Network for Mach...

Please sign up or login with your details

Forgot password? Click here to reset