Check-COVID: Fact-Checking COVID-19 News Claims with Scientific Evidence

05/29/2023
by   Gengyu Wang, et al.
0

We present a new fact-checking benchmark, Check-COVID, that requires systems to verify claims about COVID-19 from news using evidence from scientific articles. This approach to fact-checking is particularly challenging as it requires checking internet text written in everyday language against evidence from journal articles written in formal academic language. Check-COVID contains 1, 504 expert-annotated news claims about the coronavirus paired with sentence-level evidence from scientific journal articles and veracity labels. It includes both extracted (journalist-written) and composed (annotator-written) claims. Experiments using both a fact-checking specific system and GPT-3.5, which respectively achieve F1 scores of 76.99 and 69.90 on this task, reveal the difficulty of automatically fact-checking both claim types and the importance of in-domain data for good performance. Our data and models are released publicly at https://github.com/posuer/Check-COVID.

READ FULL TEXT
research
04/30/2020

Fact or Fiction: Verifying Scientific Claims

We introduce the task of scientific fact-checking. Given a corpus of sci...
research
10/25/2021

SciClops: Detecting and Contextualizing Scientific Claims for Assisting Manual Fact-Checking

This paper describes SciClops, a method to help combat online scientific...
research
06/07/2021

COVID-Fact: Fact Extraction and Verification of Real-World Claims on COVID-19 Pandemic

We introduce a FEVER-like dataset COVID-Fact of 4,086 claims concerning ...
research
06/08/2020

Misinformation has High Perplexity

Debunking misinformation is an important and time-critical task as there...
research
11/02/2021

Assessing Effectiveness of Using Internal Signals for Check-Worthy Claim Identification in Unlabeled Data for Automated Fact-Checking

While recent work on automated fact-checking has focused mainly on verif...
research
10/27/2021

FacTeR-Check: Semi-automated fact-checking through Semantic Similarity and Natural Language Inference

Our society produces and shares overwhelming amounts of information thro...
research
04/29/2020

A Benchmark Dataset of Check-worthy Factual Claims

In this paper we present the ClaimBuster dataset of 23,533 statements ex...

Please sign up or login with your details

Forgot password? Click here to reset