Rumour Detection via Zero-shot Cross-lingual Transfer Learning

09/27/2021
by   Lin Tian, et al.
9

Most rumour detection models for social media are designed for one specific language (mostly English). There are over 40 languages on Twitter and most languages lack annotated resources to build rumour detection models. In this paper we propose a zero-shot cross-lingual transfer learning framework that can adapt a rumour detection model trained for a source language to another target language. Our framework utilises pretrained multilingual language models (e.g.multilingual BERT) and a self-training loop to iteratively bootstrap the creation of ”silver labels” in the target language to adapt the model from the source language to the target language. We evaluate our methodology on English and Chinese rumour datasets and demonstrate that our model substantially outperforms competitive benchmarks in both source and target language rumour detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2021

Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training

In recent years, pre-trained multilingual language models, such as multi...
research
10/20/2021

Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair Prediction

We evaluate a simple approach to improving zero-shot multilingual transf...
research
04/19/2019

Zero-Shot Cross-Lingual Opinion Target Extraction

Aspect-based sentiment analysis involves the recognition of so called op...
research
11/09/2022

Cross-lingual Transfer Learning for Check-worthy Claim Identification over Twitter

Misinformation spread over social media has become an undeniable infodem...
research
03/18/2020

X-Stance: A Multilingual Multi-Target Dataset for Stance Detection

We extract a large-scale stance detection dataset from comments written ...
research
10/10/2019

Language Transfer for Early Warning of Epidemics from Social Media

Statements on social media can be analysed to identify individuals who a...
research
10/11/2020

Detecting Foodborne Illness Complaints in Multiple Languages Using English Annotations Only

Health departments have been deploying text classification systems for t...

Please sign up or login with your details

Forgot password? Click here to reset