Team Hitachi at SemEval-2023 Task 3: Exploring Cross-lingual Multi-task Strategies for Genre and Framing Detection in Online News

03/03/2023
by   Yuta Koreeda, et al.
0

This paper explains the participation of team Hitachi to SemEval-2023 Task 3 "Detecting the genre, the framing, and the persuasion techniques in online news in a multi-lingual setup." Based on the multilingual, multi-task nature of the task and the setting that training data is limited, we investigated different strategies for training the pretrained language models under low resource settings. Through extensive experiments, we found that (a) cross-lingual/multi-task training, and (b) collecting an external balanced dataset, can benefit the genre and framing detection. We constructed ensemble models from the results and achieved the highest macro-averaged F1 scores in Italian and Russian genre categorization subtasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2019

MultiFiT: Efficient Multi-lingual Language Model Fine-tuning

Pretrained language models are promising particularly for low-resource l...
research
12/22/2018

Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification

Text classification must sometimes be applied in situations with no trai...
research
08/26/2022

Cross-lingual Transfer Learning for Fake News Detector in a Low-Resource Language

Development of methods to detect fake news (FN) in low-resource language...
research
06/07/2023

Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes

This paper proposes Allophant, a multilingual phoneme recognizer. It req...
research
04/27/2023

NAP at SemEval-2023 Task 3: Is Less Really More? (Back-)Translation as Data Augmentation Strategies for Detecting Persuasion Techniques

Persuasion techniques detection in news in a multi-lingual setup is non-...
research
11/29/2022

Compressing Cross-Lingual Multi-Task Models at Qualtrics

Experience management is an emerging business area where organizations f...
research
09/05/2023

Leveraging BERT Language Models for Multi-Lingual ESG Issue Identification

Environmental, Social, and Governance (ESG) has been used as a metric to...

Please sign up or login with your details

Forgot password? Click here to reset