Selecting Data to Clean for Fact Checking: Minimizing Uncertainty vs. Maximizing Surprise

09/11/2019
by   Stavros Sintos, et al.
0

We study the optimization problem of selecting numerical quantities to clean in order to fact-check claims based on such data. Oftentimes, such claims are technically correct, but they can still mislead for two reasons. First, data may contain uncertainty and errors. Second, data can be "fished" to advance particular positions. In practice, fact-checkers cannot afford to clean all data and must choose to clean what "matters the most" to checking a claim. We explore alternative definitions of what "matters the most": one is to ascertain claim qualities (by minimizing uncertainty in these measures), while an alternative is just to counter the claim (by maximizing the probability of finding a counterargument). We show whether the two objectives align with each other, with important implications on when fact-checkers should exercise care in selective data cleaning, to avoid potential bias introduced by their desire to counter claims. We develop efficient algorithms for solving the various variants of the optimization problem, showing significant improvements over naive solutions. The problem is particularly challenging because the objectives in the fact-checking context are complex, non-linear functions over data. We obtain results that generalize to a large class of functions, with potential applications beyond fact-checking.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2022

Missing Counter-Evidence Renders NLP Fact-Checking Unrealistic for Misinformation

Misinformation emerges in times of uncertainty when credible information...
research
09/20/2021

The Case for Claim Difficulty Assessment in Automatic Fact Checking

Fact-checking is the process (human, automated, or hybrid) by which clai...
research
09/22/2021

Scalable Fact-checking with Human-in-the-Loop

Researchers have been investigating automated solutions for fact-checkin...
research
04/15/2021

The Role of Context in Detecting Previously Fact-Checked Claims

Recent years have seen the proliferation of disinformation and misinform...
research
08/20/2020

Checkworthiness in Automatic Claim Detection Models: Definitions and Analysis of Datasets

Public, professional and academic interest in automated fact-checking ha...
research
08/04/2019

Automatic Fact-Checking Using Context and Discourse Information

We study the problem of automatic fact-checking, paying special attentio...
research
05/22/2023

LM vs LM: Detecting Factual Errors via Cross Examination

A prominent weakness of modern language models (LMs) is their tendency t...

Please sign up or login with your details

Forgot password? Click here to reset