AVeriTeC: A dataset for real-world claim verification with evidence from the web

05/22/2023
by   Michael Schlichtkrull, et al.
0

Existing datasets for automated fact-checking have substantial limitations, such as relying on artificial claims, lacking annotations for evidence and intermediate reasoning, or including evidence published after the claim. In this paper we introduce AVeriTeC, a new dataset of 4,568 real-world claims covering fact-checks by 50 different organizations. Each claim is annotated with question-answer pairs supported by evidence available online, as well as textual justifications explaining how the evidence combines to produce a verdict. Through a multi-round annotation process, we avoid common pitfalls including context dependence, evidence insufficiency, and temporal leakage, and reach a substantial inter-annotator agreement of κ=0.619 on verdicts. We develop a baseline as well as an evaluation scheme for verifying claims through several question-answering steps against the open web.

READ FULL TEXT

page 19

page 20

page 23

page 24

page 27

page 31

research
05/19/2023

Complex Claim Verification with Evidence Retrieved in the Wild

Evidence retrieval is a core part of automatic fact-checking. Prior work...
research
09/07/2022

Fact-Saboteurs: A Taxonomy of Evidence Manipulation Attacks against Fact-Verification Systems

Mis- and disinformation are now a substantial global threat to our secur...
research
09/01/2022

A Dataset for Detecting Real-World Environmental Claims

In this paper, we introduce an expert-annotated dataset for detecting re...
research
01/26/2022

CsFEVER and CTKFacts: Czech Datasets for Fact Verification

In this paper, we present two Czech datasets for automated fact-checking...
research
04/27/2021

A Knowledge Enhanced Learning and Semantic Composition Model for Multi-Claim Fact Checking

To inhibit the spread of rumorous information and its severe consequence...
research
05/31/2021

Zero-shot Fact Verification by Claim Generation

Neural models for automated fact verification have achieved promising re...
research
07/21/2023

MythQA: Query-Based Large-Scale Check-Worthy Claim Detection through Multi-Answer Open-Domain Question Answering

Check-worthy claim detection aims at providing plausible misinformation ...

Please sign up or login with your details

Forgot password? Click here to reset