Rapidly Bootstrapping a Question Answering Dataset for COVID-19

04/23/2020
by   Raphael Tang, et al.
7

We present CovidQA, the beginnings of a question answering dataset specifically designed for COVID-19, built by hand from knowledge gathered from Kaggle's COVID-19 Open Research Dataset Challenge. To our knowledge, this is the first publicly available resource of its type, and intended as a stopgap measure for guiding research until more substantial evaluation resources become available. While this dataset, comprising 124 question-article pairs as of the present version 0.1 release, does not have sufficient examples for supervised machine learning, we believe that it can be helpful for evaluating the zero-shot or transfer capabilities of existing models on topics specifically related to COVID-19. This paper describes our methodology for constructing the dataset and presents the effectiveness of a number of baselines, including term-based techniques and various transformer-based models. The dataset is available at http://covidqa.ai/

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/16/2021

Transformer-Based Models for Question Answering on COVID19

In response to the Kaggle's COVID-19 Open Research Dataset (CORD-19) cha...
research
09/14/2022

UIT-ViCoV19QA: A Dataset for COVID-19 Community-based Question Answering on Vietnamese Language

For the last two years, from 2020 to 2021, COVID-19 has broken disease p...
research
11/08/2022

COV19IR : COVID-19 Domain Literature Information Retrieval

Increasing number of COVID-19 research literatures cause new challenges ...
research
05/12/2021

Encoding Explanatory Knowledge for Zero-shot Science Question Answering

This paper describes N-XKT (Neural encoding based on eXplanatory Knowled...
research
07/02/2020

Project PIAF: Building a Native French Question-Answering Dataset

Motivated by the lack of data for non-English languages, in particular f...
research
08/19/2023

Breaking Language Barriers: A Question Answering Dataset for Hindi and Marathi

The recent advances in deep-learning have led to the development of high...
research
05/26/2020

What Are People Asking About COVID-19? A Question Classification Dataset

We present COVID-Q, a set of 1,690 questions about COVID-19 from 13 sour...

Please sign up or login with your details

Forgot password? Click here to reset