CORD-19: The COVID-19 Open Research Dataset

04/22/2020
by   Lucy Lu Wang, et al.
0

The COVID-19 Open Research Dataset (CORD-19) is a growing resource of scientific papers on COVID-19 and related historical coronavirus research. CORD-19 is designed to facilitate the development of text mining and information retrieval systems over its rich collection of metadata and structured full text papers. Since its release, CORD-19 has been downloaded over 200K times and has served as the basis of many COVID-19 text mining and discovery systems. In this article, we describe the mechanics of dataset construction, highlighting challenges and key design decisions, provide an overview of how CORD-19 has been used, and describe several shared tasks built around the dataset. We hope this resource will continue to bring together the computing community, biomedical experts, and policy makers in the search for effective treatments and management policies for COVID-19.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2022

A Summary of COVID-19 Datasets

This research presents a review of main datasets that are developed for ...
research
02/10/2021

Accelerating COVID-19 research with graph mining and transformer-based learning

In 2020, the White House released the, "Call to Action to the Tech Commu...
research
11/08/2022

COV19IR : COVID-19 Domain Literature Information Retrieval

Increasing number of COVID-19 research literatures cause new challenges ...
research
07/22/2021

Reproducibility of COVID-19 pre-prints

To examine the reproducibility of COVID-19 research, we create a dataset...
research
04/06/2020

Discovering associations in COVID-19 related research papers

A COVID-19 pandemic has already proven itself to be a global challenge. ...
research
04/10/2020

Rapidly Deploying a Neural Search Engine for the COVID-19 Open Research Dataset: Preliminary Thoughts and Lessons Learned

We present the Neural Covidex, a search engine that exploits the latest ...
research
09/01/2022

A large dataset of software mentions in the biomedical literature

We describe the CZ Software Mentions dataset, a new dataset of software ...

Please sign up or login with your details

Forgot password? Click here to reset