A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19

06/19/2020
by   David Oniani, et al.
26

COVID-19 has resulted in an ongoing pandemic and as of 12 June 2020, has caused more than 7.4 million cases and over 418,000 deaths. The highly dynamic and rapidly evolving situation with COVID-19 has made it difficult to access accurate, on-demand information regarding the disease. Online communities, forums, and social media provide potential venues to search for relevant questions and answers, or post questions and seek answers from other members. However, due to the nature of such sites, there are always a limited number of relevant questions and responses to search from, and posted questions are rarely answered immediately. With the advancements in the field of natural language processing, particularly in the domain of language models, it has become possible to design chatbots that can automatically answer consumer questions. However, such models are rarely applied and evaluated in the healthcare domain, to meet the information needs with accurate and up-to-date healthcare data. In this paper, we propose to apply a language model for automatically answering questions related to COVID-19 and qualitatively evaluate the generated responses. We utilized the GPT-2 language model and applied transfer learning to retrain it on the COVID-19 Open Research Dataset (CORD-19) corpus. In order to improve the quality of the generated responses, we applied 4 different approaches, namely tf-idf, BERT, BioBERT, and USE to filter and retain relevant sentences in the responses. In the performance evaluation step, we asked two medical experts to rate the responses. We found that BERT and BioBERT, on average, outperform both tf-idf and USE in relevance-based sentence filtering tasks. Additionally, based on the chatbot, we created a user-friendly interactive web application to be hosted online.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2019

How Good is Artificial Intelligence at Automatically Answering Consumer Questions Related to Alzheimer's Disease?

Alzheimer's Disease (AD) is the most common type of dementia, comprising...
research
06/23/2023

Retrieving Supporting Evidence for LLMs Generated Answers

Current large language models (LLMs) can exhibit near-human levels of pe...
research
11/04/2019

BAS: An Answer Selection Method Using BERT Language Model

In recent years, Question Answering systems have become more popular and...
research
07/22/2023

Psy-LLM: Scaling up Global Mental Health Psychological Services with AI-based Large Language Models

The demand for psychological counseling has grown significantly in recen...
research
04/06/2023

ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about

Large language models have gained considerable interest for their impres...
research
04/19/2022

Where Was COVID-19 First Discovered? Designing a Question-Answering System for Pandemic Situations

The COVID-19 pandemic is accompanied by a massive "infodemic" that makes...

Please sign up or login with your details

Forgot password? Click here to reset