HULAT at SemEval-2023 Task 9: Data augmentation for pre-trained transformers applied to Multilingual Tweet Intimacy Analysis

02/24/2023
by   Isabel Segura-Bedmar, et al.
0

This paper describes our participation in SemEval-2023 Task 9, Intimacy Analysis of Multilingual Tweets. We fine-tune some of the most popular transformer models with the training dataset and synthetic data generated by different data augmentation techniques. During the development phase, our best results were obtained by using XLM-T. Data augmentation techniques provide a very slight improvement in the results. Our system ranked in the 27th position out of the 45 participating systems. Despite its modest results, our system shows promising results in languages such as Portuguese, English, and Dutch. All our code is available in the repository <https://github.com/isegura/hulat_intimacy>.

READ FULL TEXT
research
02/24/2023

HULAT at SemEval-2023 Task 10: Data augmentation for pre-trained transformers applied to the detection of sexism in social media

This paper describes our participation in SemEval-2023 Task 10, whose go...
research
11/03/2022

Exploring the State-of-the-Art Language Modeling Methods and Data Augmentation Techniques for Multilingual Clause-Level Morphology

This paper describes the KUIS-AI NLP team's submission for the 1^st Shar...
research
04/11/2022

HFL at SemEval-2022 Task 8: A Linguistics-inspired Regression Model with Data Augmentation for Multilingual News Similarity

This paper describes our system designed for SemEval-2022 Task 8: Multil...
research
05/23/2023

LLM-powered Data Augmentation for Enhanced Crosslingual Performance

This paper aims to explore the potential of leveraging Large Language Mo...
research
06/20/2022

Technical Report: Combining knowledge from Transfer Learning during training and Wide Resnets

In this report, we combine the idea of Wide ResNets and transfer learnin...
research
07/05/2023

PULSAR at MEDIQA-Sum 2023: Large Language Models Augmented by Synthetic Dialogue Convert Patient Dialogues to Medical Records

This paper describes PULSAR, our system submission at the ImageClef 2023...
research
05/31/2022

Multilingual Transformers for Product Matching – Experiments and a New Benchmark in Polish

Product matching corresponds to the task of matching identical products ...

Please sign up or login with your details

Forgot password? Click here to reset