Matching Tweets With Applicable Fact-Checks Across Languages

02/14/2022
by   Ashkan Kazemi, et al.
21

An important challenge for news fact-checking is the effective dissemination of existing fact-checks. This in turn brings the need for reliable methods to detect previously fact-checked claims. In this paper, we focus on automatically finding existing fact-checks for claims made in social media posts (tweets). We conduct both classification and retrieval experiments, in monolingual (English only), multilingual (Spanish, Portuguese), and cross-lingual (Hindi-English) settings using multilingual transformer models such as XLM-RoBERTa and multilingual embeddings such as LaBSE and SBERT. We present promising results for "match" classification (93 also find that a BM25 baseline outperforms state-of-the-art multilingual embedding models for the retrieval task during our monolingual experiments. We highlight and discuss NLP challenges while addressing this problem in different languages, and we introduce a novel curated dataset of fact-checks and corresponding tweets for future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2022

Cross-lingual Transfer Learning for Check-worthy Claim Identification over Twitter

Misinformation spread over social media has become an undeniable infodem...
research
05/13/2023

Multilingual Previously Fact-Checked Claim Retrieval

Fact-checkers are often hampered by the sheer amount of online content t...
research
02/23/2022

MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset

Misinformation is becoming increasingly prevalent on social media and in...
research
12/16/2020

Multilingual Evidence Retrieval and Fact Verification to Combat Global Disinformation: The Power of Polyglotism

This article investigates multilingual evidence retrieval and fact verif...
research
06/01/2021

Claim Matching Beyond English to Scale Global Fact-Checking

Manual fact-checking does not scale well to serve the needs of the inter...
research
04/17/2023

ERTIM@MC2: Diversified Argumentative Tweets Retrieval

In this paper, we present our participation to CLEF MC2 2018 edition for...
research
09/16/2020

NABU - Multilingual Graph-based Neural RDF Verbalizer

The RDF-to-text task has recently gained substantial attention due to co...

Please sign up or login with your details

Forgot password? Click here to reset