A Comprehensive Dictionary and Term Variation Analysis for COVID-19 and SARS-CoV-2

10/27/2020
by   Robert Leaman, et al.
0

The number of unique terms in the scientific literature used to refer to either SARS-CoV-2 or COVID-19 is remarkably large and has continued to increase rapidly despite well-established standardized terms. This high degree of term variation makes high recall identification of these important entities difficult. In this manuscript we present an extensive dictionary of terms used in the literature to refer to SARS-CoV-2 and COVID-19. We use a rule-based approach to iteratively generate new term variants, then locate these variants in a large text corpus. We compare our dictionary to an extensive collection of terminological resources, demonstrating that our resource provides a substantial number of additional terms. We use our dictionary to analyze the usage of SARS-CoV-2 and COVID-19 terms over time and show that the number of unique terms continues to grow rapidly. Our dictionary is freely available at https://github.com/ncbi-nlp/CovidTermVar.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2022

Comprehensive identification of Long Covid articles with human-in-the-loop machine learning

A significant percentage of COVID-19 survivors experience ongoing multis...
research
10/11/2021

COVID-Datathon: Biomarker identification for COVID-19 severity based on BALF scRNA-seq data

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emergen...
research
06/03/2020

Extracting COVID-19 Events from Twitter

We present a corpus of 7,500 tweets annotated with COVID-19 events, incl...
research
08/31/2016

A Dictionary-based Approach to Racism Detection in Dutch Social Media

We present a dictionary-based approach to racism detection in Dutch soci...
research
06/11/2020

COVID-19-CT-CXR: a freely accessible and weakly labeled chest X-ray and CT image collection on COVID-19 from biomedical literature

The latest threat to global health is the COVID-19 outbreak. Although th...
research
04/06/2020

Building a Norwegian Lexical Resource for Medical Entity Recognition

We present a large Norwegian lexical resource of categorized medical ter...
research
06/22/2022

Connecting a French Dictionary from the Beginning of the 20th Century to Wikidata

The Petit Larousse illustré is a French dictionary first published in 19...

Please sign up or login with your details

Forgot password? Click here to reset