CALCS 2021 Shared Task: Machine Translation for Code-Switched Data

02/19/2022
by   Shuguang Chen, et al.
2

To date, efforts in the code-switching literature have focused for the most part on language identification, POS, NER, and syntactic parsing. In this paper, we address machine translation for code-switched social media data. We create a community shared task. We provide two modalities for participation: supervised and unsupervised. For the supervised setting, participants are challenged to translate English into Hindi-English (Eng-Hinglish) in a single direction. For the unsupervised setting, we provide the following language pairs: English and Spanish-English (Eng-Spanglish), and English and Modern Standard Arabic-Egyptian Arabic (Eng-MSAEA) in both directions. We share insights and challenges in curating the "into" code-switching language evaluation data. Further, we provide baselines for all language pairs in the shared task. The leaderboard for the shared task comprises 12 individual system submissions corresponding to 5 different teams. The best performance achieved is 12.67 English.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2021

Investigating Code-Mixed Modern Standard Arabic-Egyptian to English Machine Translation

Recent progress in neural machine translation (NMT) has made it possible...
research
09/12/2023

Overview of GUA-SPA at IberLEF 2023: Guarani-Spanish Code Switching Analysis

We present the first shared task for detecting and analyzing code-switch...
research
09/28/2019

Overview for the Second Shared Task on Language Identification in Code-Switched Data

We present an overview of the second shared task on language identificat...
research
07/15/2019

Naver Labs Europe's Systems for the WMT19 Machine Translation Robustness Task

This paper describes the systems that we submitted to the WMT19 Machine ...
research
10/20/2022

The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared Task (MixMT)

The University of Edinburgh participated in the WMT22 shared task on cod...
research
08/16/2019

UDS--DFKI Submission to the WMT2019 Similar Language Translation Shared Task

In this paper we present the UDS-DFKI system submitted to the Similar La...
research
04/16/2018

Universal Dependency Parsing for Hindi-English Code-switching

Code-switching is a phenomenon of mixing grammatical structures of two o...

Please sign up or login with your details

Forgot password? Click here to reset