DeepAI AI Chat
Log In Sign Up

Overview for the Second Shared Task on Language Identification in Code-Switched Data

by   Giovanni Molina, et al.
George Washington University
University of Houston

We present an overview of the second shared task on language identification in code-switched data. For the shared task, we had code-switched data from two different language pairs: Modern Standard Arabic-Dialectal Arabic (MSA-DA) and Spanish-English (SPA-ENG). We had a total of nine participating teams, with all teams submitting a system for SPA-ENG and four submitting for MSA-DA. Through evaluation, we found that once again language identification is more difficult for the language pair that is more closely related. We also found that this year's systems performed better overall than the systems from the previous shared task indicating overall progress in the state of the art for this task.


page 1

page 2

page 3

page 4


NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task

We present the findings and results of the Second Nuanced Arabic Dialect...

ACTI at EVALITA 2023: Overview of the Conspiracy Theory Identification Task

Conspiracy Theory Identication task is a new shared task proposed for th...

CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

To date, efforts in the code-switching literature have focused for the m...

CCKS 2019 Shared Task on Inter-Personal Relationship Extraction

The CCKS2019 shared task was devoted to inter-personal relationship extr...

NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task

We describe findings of the third Nuanced Arabic Dialect Identification ...

NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task

We present the results and findings of the First Nuanced Arabic Dialect ...

UnibucKernel Reloaded: First Place in Arabic Dialect Identification for the Second Year in a Row

We present a machine learning approach that ranked on the first place in...