Cross-lingual Transfer Learning for Check-worthy Claim Identification over Twitter

11/09/2022
by   Maram Hasanain, et al.
0

Misinformation spread over social media has become an undeniable infodemic. However, not all spreading claims are made equal. If propagated, some claims can be destructive, not only on the individual level, but to organizations and even countries. Detecting claims that should be prioritized for fact-checking is considered the first step to fight against spread of fake news. With training data limited to a handful of languages, developing supervised models to tackle the problem over lower-resource languages is currently infeasible. Therefore, our work aims to investigate whether we can use existing datasets to train models for predicting worthiness of verification of claims in tweets in other languages. We present a systematic comparative study of six approaches for cross-lingual check-worthiness estimation across pairs of five diverse languages with the help of Multilingual BERT (mBERT) model. We run our experiments using a state-of-the-art multilingual Twitter dataset. Our results show that for some language pairs, zero-shot cross-lingual transfer is possible and can perform as good as monolingual models that are trained on the target language. We also show that in some languages, this approach outperforms (or at least is comparable to) state-of-the-art models.

READ FULL TEXT
research
09/27/2021

Rumour Detection via Zero-shot Cross-lingual Transfer Learning

Most rumour detection models for social media are designed for one speci...
research
02/14/2022

Matching Tweets With Applicable Fact-Checks Across Languages

An important challenge for news fact-checking is the effective dissemina...
research
12/16/2020

Multilingual Evidence Retrieval and Fact Verification to Combat Global Disinformation: The Power of Polyglotism

This article investigates multilingual evidence retrieval and fact verif...
research
10/28/2022

Stanceosaurus: Classifying Stance Towards Multilingual Misinformation

We present Stanceosaurus, a new corpus of 28,033 tweets in English, Hind...
research
01/13/2023

Multilingual Detection of Check-Worthy Claims using World Languages and Adapter Fusion

Check-worthiness detection is the task of identifying claims, worthy to ...
research
06/05/2020

Cross-lingual Transfer Learning for COVID-19 Outbreak Alignment

The spread of COVID-19 has become a significant and troubling aspect of ...
research
10/13/2021

Cross-lingual COVID-19 Fake News Detection

The COVID-19 pandemic poses a great threat to global public health. Mean...

Please sign up or login with your details

Forgot password? Click here to reset