UPB at IberLEF-2023 AuTexTification: Detection of Machine-Generated Text using Transformer Ensembles

08/02/2023
by   Andrei-Alexandru Preda, et al.
0

This paper describes the solutions submitted by the UPB team to the AuTexTification shared task, featured as part of IberLEF-2023. Our team participated in the first subtask, identifying text documents produced by large language models instead of humans. The organizers provided a bilingual dataset for this subtask, comprising English and Spanish texts covering multiple domains, such as legal texts, social media posts, and how-to articles. We experimented mostly with deep learning models based on Transformers, as well as training techniques such as multi-task learning and virtual adversarial training to obtain better results. We submitted three runs, two of which consisted of ensemble models. Our best-performing model achieved macro F1-scores of 66.63

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2022

Transformer-based Model for Word Level Language Identification in Code-mixed Kannada-English Texts

Using code-mixed data in natural language processing (NLP) research curr...
research
09/19/2020

Aggressive Language Detection with Joint Text Normalization via Adversarial Multi-task Learning

Aggressive language detection (ALD), detecting the abusive and offensive...
research
09/07/2022

AILAB-Udine@SMM4H 22: Limits of Transformers and BERT Ensembles

This paper describes the models developed by the AILAB-Udine team for th...
research
02/05/2022

Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss

The number of increased social media users has led to a lot of people mi...
research
04/01/2022

Nowruz at SemEval-2022 Task 7: Tackling Cloze Tests with Transformers and Ordinal Regression

This paper outlines the system using which team Nowruz participated in S...
research
10/22/2019

Automatic Extraction of Personality from Text: Challenges and Opportunities

In this study, we examined the possibility to extract personality traits...
research
02/18/2022

AMS_ADRN at SemEval-2022 Task 5: A Suitable Image-text Multimodal Joint Modeling Method for Multi-task Misogyny Identification

Women are influential online, especially in image-based social media suc...

Please sign up or login with your details

Forgot password? Click here to reset